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Background of the Invention 

Functional analysis of cellular proteins is greatly facilitated through changes in the 
expression level of the corresponding gene for subsequent analysis of the accompanying 
phenotype. For this approach, an inducible expression system controlled by an external 

20 stimulus is desirable. Ideally such a system would not only mediate an M on/ofF f status for 
gene expression but would also permit limited expression of a gene at a defined level. 

Attempts to control gene activity have been made using various inducible eukaryotic 
promoters, such as those responsive to heavy metal ions (Mayo et al. (1982) Cell 29:99-108: 
Brinster et al. (1982) Nature 226:39-42; Searle et al. (1985) Mol Cell Biol 1:1480-1489), 

25 heat shock (Nouer et al. (1991) in Heat Shock Response, e.d. Nouer, L. , CRC, Boca Raton , 
FL, ppl67-220) or hormones (Lee et al. (1981) Nature 224:228-232; Hynes et al. (1981) 
Proc. Natl Acad Scl USA 2£:203 8-2042; Klock et al. (1987) Nature 322:734-736; Israel & 
Kaufman (1989) Nucl Acids Res. 12:2589-2604). However, these systems have generally 
suffered from one or both of the following problems: (1) the inducer (e,g, heavy metal ions, 

30 heat shock or steroid hormones) evokes pleiotropic effects, which can complicate analyses, 
and (2) many promoter systems exhibit high levels of basal activity in the non-induced state, 
which prevents shut-off the regulated gene and results in modest induction factors. 

An approach to circumventing these limitations is to introduce regulatory elements 
from evolutionarily distant species such as E.coli into higher eukaryotic cells with the 

35 anticipation that effectors which modulate such regulatory circuits will be inert to eukaryotic 
cellular physiology and, consequently, will not elicit pleiotropic effects in eukaryotic cells. 
For example, the Lac repressor (lacR)/operator/inducer system of E. coli functions in 
eukaryotic cells and has been used to regulate gene expression by three different approaches: 
(1) prevention of transcription initiation by properly placed lac operators at promoter sites 




(Hu & Davidson (1987) CW/4&555-566; Brown et al. (1987) Cell 42:603-6 12; Figge et al. 
(1988) Cell 52:713-722; Fuerst et al. (1989) Proc. Natl. Acad ScL USA £6:2549-2553: 
Deuschle et al. (1989) Proc. Natl Acad. ScL USA £6:5400-5405); (2) blockage of 
transcribing RNA polymerase II during elongation by a LacR/operator complex (Deuschle et 
5 al. (1990) Science 24&480-483); and (3) activation of a promoter responsive to a fusion 
between LacR and the activation domain of herpes simples virus (HSV) virion protein 16 
(VP16)<Labow et al. (1990) Mol Cell Biol I&3343-3356; Bairn et al. (1991) Proc. Natl 
Acad ScL USA ££:5072-5076). 

In one version of the Lac system, expression of lac operator-linked sequences is 

10 constitutively activated by a LacR-VP16 fusion protein and is turned off in the presence of 
isopropyl-P-D-thiogalactopyranoside (IPTG) (Labow et al. (1990), cited supra). In another 
version of the system, a lacR-VP16 variant is used which binds to lac operators in the 
presence of IPTG, which can be enhanced by increasing the temperature of the cells (Bairn et 
al.(1991), cited supra). The utility of these lac systems in eukaryotic cells is limited, in part, 

1 5 because IPTG acts slowly and inefficiently in eukaryotic cells and must be used at 

concentrations which approach cytotoxic levels. Alternatively, use of a temperature shift to 
induce gene expression is likely to elicit pleiotropic effects in the cells. Thus, there is a need 
for a more efficient inducible regulatory system which exhibits rapid and high level induction 
of gene expression and in which the inducer is tolerated by eukaryotic cells without 

20 cytoxicity or pleiotropic effects. 

Components of the tetracycline (Tc) resistance system of E. coli have also been 
found to function in eukaryotic cells and have been used to regulate gene expression. For 
example, the Tet repressor (TetR), which binds to tet operator sequences in the absence of 
tetracycline and represses gene transcription, has been expressed in plant cells at sufficiently 

25 high concentrations to repress transcription from a promoter containing tet operator 

sequences (Gatz, C. et al. (1992) Plant J. 2:397-404). However, very high intracellular 
concentrations of TetR are necessary to keep gene expression down-regulated in cells, which 
may not be achievable in many situations, thus leading to "leakiness" in the system. 

In other studies, TetR has been fused to the activation domain of VP 16 to create a 

30 tetracycline-controlled transcriptional activator (tTA) (Gossen, M. and Bujard, H. (1992) 
Proc. Natl Acad. ScL USA £2:5547-5551). The tTA fusion protein is regulated by 
tetracycline in the same manner as TetR, i.e., tTA binds to tet operator sequences in the 
absence of tetracycline but not in the presence of tetracycline. Thus, in this system, in the 
continuous presence of Tc, gene expression is kept off, and to induce transcription, Tc is 

35 removed. 



gpmpiary of the Invention 

This invention pertains to a regulatory system which utilizes components of the Tet 
repressor/operator/inducer system of prokaryotes to regulate gene expression in eukaryotic 
5 cells. In particular, this invention provides transgenic animals carrying a polynucleotide 
sequence encoding a tetracycline-regulated transcriptional inhibitor fusion protein which is 
useful for inhibiting expression, in a highly controlled manner, of a gene linked to one or 
more tet operator sequences. The fusion protein comprises a first polypeptide which binds to 
a tet operator sequence, operatively linked to a heterologous second polypeptide which 
10 inhibits transcription in eukaryotic cells. As used herein, the term "heterologous" used in 
reference to the second polypeptide is intended to indicate that the second polypeptide is 
derived from a different protein than the first polypeptide. 

In one embodiment, the inhibitor fusion protein binds to tet operator sequences in the 
absence but not the presence of tetracycline. In another embodiment, the inhibitor fusion 
1 5 protein binds to tet operator sequences in the presence but not the absence of tetracycline. 
The second polypeptide of the fusion protein comprises a transcriptional inhibitor domain 
(e.g., a "silencer" domain), such as a transcription silencer domain of a protein selected from 
the group consisting of v-erbA, the Drosophila Krueppel protein, the retinoic acid receptor 
alpha, the thyroid hormone receptor alpha, the yeast Ssn6/Tupl protein complex, the 
20 Drosophila protein even-skipped, SIR1 , NePl, the Drosophila dorsal protein, TSF3, SFI, the 
Drosophila hunchback protein, the Drosophila knirps protein, WT1, Oct-2.1, the Drosophila 
engrailed protein, E4BP4 and ZF5. 

A transgenic animal of the invention can further have a second transgene comprising a 
gene of interest operably linked to at least one tet operator sequence. 
25 The transgenic animal of the invention can be, for example, a mouse. In other 

embodiment, the animal is a cow, a goat, a sheep or a pig. 

The transgene(s) of the invention can be integrated randomly or at a predetermined 
location within the genome of the animal. 

The invention further provides a method for modulating transcription of a tet 
30 operator-linked transgene in an animal of the invention, involving administering tetracycline 
or a tetracycline analogue to the animal. 



• # 

Brief Description of the Drawings 

Figure 7 is a bar graph depicting the stimulation of luciferase activity in HR5-C1 1 
cells by tetracycline and different tetracycline analogues (1 ng/ml fx.)- Cells were grown in 
the absence (-) or presence of the indicated tetracyclines for 3 days before luciferase activity 
5 was determined. Each solid and hatched bar represents the luciferase activity of a single 
culture dish. 

Figure 2 is a graph depicting the relative luciferase activity in HR5-C1 1 cells when 
incubated with different concentrations of doxycycline. The results of three independent 
experiments are shown. 

10 Figure 5 is a graph depicting the kinetics of induction of luciferase activity in HR5- 

Cl 1 cells by doxycycline. HR5-C1 1 cultures were exposed to 1 jag/ml of doxycycline and 
luciferase activity was measured after different time intervals; (•) cultures containing 
doxycycline, (o) cultures grown in the absence of antibiotic. 

Figure 4 shows the amino acid sequences of various classes of Tet repressors, 
15 illustrating the homology between the amino acid sequences of different classes of Tet 

repressors, as compared to class B Tet repressors (e.g., TnlO-derived). Amino acid positions 
in other classes of Tet repressors that are identical to class B are indicated by a dash. 

Figure 5 shows the nucleotide sequences of tet operators of different classes: class A 
(SEQ ID NO: 1 1), class B (SEQ ID NO: 12), class C (SEQ ID NO: 13), class D (SEQ ID 
20 NO: 14) and class E (SEQ ID NO: 1 5). 

Figure 6 is a schematic diagram of a bidirectional promoter construct for coordinate 
regulation of two genes of interest operatively linked to the same tet operators for regulation 
by a tetracycline-regulated transcriptional activator. 

Figure 7 A (SEQ ID NO: 6) shows the nucleotide sequence of a bidirectional promoter 
25 region for coordinate regulation of two genes of interest by a tetracycline-regulated 
transcriptional activator. 

Figure 7B (SEQ ID NO: 7) shows the nucleotide sequence of a bidirectional promoter 
region for coordinate regulation of two genes of interest by a tetracycline-regulated 
transcriptional activator. 
30 Figure 8 is two graphs depicting coordinate expression of luciferase and 0- 

galactosidase activity by a tetracycline-regulated transcriptional activator. 

Figure 9A-B are schematic diagrams of self-regulating promoters for expression of 
tetracycline-regulated transcriptional activators (tTA). Panel A illustrates self-regulation of 
expression of a wild-type Tet repressor-containing transactivator fusion protein that binds to 
35 tet operators in the absence of Tc. Panel B illustrates self-regulation of expression of a 

mutated Tet repressor-containing transactivator fusion protein that binds to tet operators in 
the presence of Tc. 

Figure 10 is a schematic diagram of the negative and positive regulation of sTtet 
operator (te/Owt)-linked gene of interest by a tetracycline-regulated transcriptional inhibitor 



protein (tSD) and a tetracycline-inducible transcriptional activator fusion protein (rtTA), 
respectively, in the presence of increasing concentrations of the tetracycline analogue 
doxycycline. 

Figure 11 is a schematic diagram of the construction of TetR-silencer domain fusion 
5 contructs by in-ftame fusion of nucleic acid encoding either a Krueppel or v-erbA silencer 
domain to the 3' end of nucleic acid encoding a Tet repressor (tetR gene). 

Figure 12 is a graphic representation of the expression of luciferase activity in mice 
transgenic for the luciferase reporter gene alone (checked columns at right) or double 
transgenic animals carrying the luciferase reporter gene and a tTA R transgene, either in the 
10 absence of doxycycline (dark columns in middle) or in the presence of doxycycline (light 
columns at left). 

Figure 13 is a graphic representation of luciferase activity in cells cotransfected with a 
tTA4 transactivator construct and a tetOQ^ luciferase reporter gene in the presence or absence 
of doxycycline (first pair of columns), a tTA R 6 transactivator construct and a tetO c $ 

15 luciferase reporter gene in the presence or absence of doxycycline (second pair of columns), 
tTA 4 and tTA R 6 transactivator constructs and tetOC4 luciferase reporter gene in the presence 
or absence of doxycycline (third pair of columns) or tTA 4 and tTA R 6 transactivator 
constructs and tetO C6 luciferase reporter gene in the presence or absence of doxycycline 
(fourth pair of columns). 

20 Figure 14 is a schematic diagram of model regulatory systems for coordinate 

regulation of one gene in a cell using a reverse activator and a silencer (Model A) or 
independent regulation of two genes in a cell using an activator and a reverse activator 
(Model B). 

25 Detailed Inscription of the Invention 

This invention pertains to nucleic acid molecules and proteins which can be used to 
regulate the expression of genes in eukaryotic cells or animals in a highly controlled manner. 
Regulation of gene expression by the system of the invention involves at least two 
components: A gene which is operatively linked to a regulatory sequence and a protein 

30 which, in either the presence or absence of an inducible agent, binds to the regulatory 
sequence and either activates or inhibits transcription of the gene. The system of the 
invention utilizes components of the Tet repressor/operator/inducer system of prokaryotes to 
stimulate gene expression in eukaryotic cells. 

Various aspects of the invention pertain to fusion proteins which are capable of either 

35 activating or inhibiting gene transcription when bound to tet operator (JetO) sequences, but 
which bind to tet operator sequences only in the presence or, alternatively, in the absence of 
tetracycline, or an analogue thereof. Th^s, in a host cell, transcription of a gene operatively 
linked to a tet operator sequence(s) is stimulated or inhibited by a fusion protein of the 
invention by altering the concentration of tetracycline (or analogue) in contact with the host 




cell (e.g., adding or removing tetracycline from a culture medium, or administering or 
ceasing to administer tetracycline to a host organism, etc.). 

The invention further pertains to target transcription units for regulation by the fusion 
protein of the invention. In addition to allowing for regulation of a single tef-operator linked 
gene of interest, the invention also provides novel transcription units containing two or more 
genes to be transcribed that can be regulated in either a coordinate or independent manner by 
a transactivator fusion protein of the invention. Methods for stimulating or inhibiting 
transcription of a gene using tetracycline (or analogues thereof), and kits which contain the 
components of the regulatory system described herein, are also encompassed by the 
invention. 

In the following subsections, the nucleic acids and proteins comprising the 
components of the inducible regulatory system of the invention, and their interrelationship, 
are discussed in greater detail. The subsections are as follows: 

I. Tetracycline-Inducible Transcriptional Activators 

A. The first polypeptide of the transactivator fusion protein 

B. The second polypeptide of the transactivator fusion protein 

C. A third polypeptide of the transactivator fusion protein 

II. Expression of a Transactivator Fusion Protein 

A. Expression vectors 

B. Host cells 

C. Introduction of nucleic acid into host cells 

D. Transgenic Organisms 

E. Homologous Recombinant Organisms 

III. Target Transcription Units Regulated by a Tetracycline-Inducible Transactivator 

A. Regulation of expression of tet operator-linked nucleotide sequences 

B. Coordinate regulation of two nucleotide sequences 

C. Independent regulation of mulitple nucleotide sequences 

D. Combined coordinate and independent regulation of multiple nucleotide 
sequences 

IV. Tetracycline-Regulated Transcriptional Inhibitors 

A. The first polypeptide of the transcritional inhibitor fusion protein 

B. The second polypeptide of the transcritional inhibitor fusion protein 

C. A third polypeptide of the transcritional inhibitor fusion protein 

D. Expression of the transcriptional inhibitor fusion protein 




V. Kits of the Invention 

VI. Regulation of Gene Expression by Tetracycline or Analogues Thereof 

A. Stimulation of gene expression by transactivator fusion proteins 

B. Inhibition of gene expression by transcriptional inhibitor fusion proteins 

C. Combined positive and negative regulation of gene expression 

VII. Applications of the Invention 

A. Gene Therapy 

B. Production of Proteins in Vitro 

C. Production of Proteins in Vivo 

D. Animal Models of Human Disease 

E. Production of Stable Cell Lines for Cloning 

I. Tetracycline-Inducible Transcriptional Activators 

In the inducible regulatory system of the invention, transcription of a gene is activated 
by a transcriptional activator protein, also referred to herein simply as a transactivator. The 
transactivator of the invention is a fusion protein. One aspect of the invention thus pertains to 
fusion proteins and nucleic acids (e.g., DNA) encoding fusion proteins. The term "fusion 
protein" is intended to describe at least two polypeptides, typically from different sources, 
which are operatively linked. With regard to the polypeptides, the term "operatively linked" 
is intended to mean that the two polypeptides are connected in manner such that each 
polypeptide can serve its intended function. Typically, the two polypeptides are covalently 
attached through peptide bonds. The fusion protein is preferably produced by standard 
recombinant DNA techniques. For example, a DNA molecule encoding the first polypeptide 
is ligated to another DNA molecule encoding the second polypeptide, and the resultant hybrid 
DNA molecule is expressed in a host cell to produce the fusion protein. The DNA molecules 
are ligated to each other in a 5 f to 3' orientation such that, after ligation, the translational 
frame of the encoded polypeptides is not altered (i.e., the DNA molecules are ligated to each 
other in-frame). 

A. The first polypeptide of the transactivator fusion protein 

The transactivator fusion protein of the invention is composed, in part, of a first 
polypeptide which binds to a tet operator sequence in the presence of tetracycline (Tc), or an 
analogue thereof. The first polypeptide of the fusion protein is preferably a mutated Tet 
repressor. The term "mutated Tet repressor" is intended to include polypeptides having an 
amino acid sequence which is similar to a wild-type Tet repressor but which has at least one 
amino acid difference from the wild-type Tet repressor. The term "wild-type Tet repressor" 
is intended to describe a protein occurring in nature which represses transcription from tet 




operator sequences in prokaryotic cells in the absence of Tc. The amino acid difference(s) 
between a mutated Tet repressor and a wild-type Tet repressor may be substitution of one or 
more amino acids, deletion of one or more amino acids or addition of one or more amino 
acids. The mutated Tet repressor of the invention has the following functional properties: 1) 
5 the polypeptide can bind to a tet operator sequence, i.e., it retains the DNA binding 
specificity of a wild-type Tet repressor; and 2) it is regulated in a reverse manner by 
tetracycline than a wild-type Tet repressor, i.e., the mutated Tet repressor binds to a tet 
operator sequence only the presence of Tc (or Tc analogue) rather than in the absence of Tc. 
In a preferred embodiment, a mutated Tet repressor having the functional properties 

10 described above is created by substitution of amino acid residues in the sequence of a wild- 
type Tet repressor. For example, as described in Example 1, a TnlO-derived Tet repressor 
having amino acid substitutions at amino acid positions 71, 95, 101 and 102 has the desired 
functional properties and thus can be used as the first polypeptide in the transactivator fusion 
protein of the invention. The amino acid sequence of this mutated Tet repressor is shown in 

15 SEQ ID NO: 2 (positions 1-207). In one embodiment of the mutated Tet repressor, position 
71 is mutated from glutamic acid to lysine, position 95 is mutated from aspartic acid to 
asparagine, position 101 is mutated from leucine to serine and position 102 is mutated from 
glycine to aspartic acid, although the invention is not limited to these particular mutations. 
Mutation of fewer than all four of these amino acid positions may be sufficient to acheive a 

20 Tet repressor with the desired functional properties. Accordingly, a Tet repressor is 
preferably mutated at at least one of these positions. Other amino acid substitutions, 
deletions or additions at these or other amino acid positions which retain the desired 
functional properties of the mutated Tet repressor are within the scope of the invention. The 
crystal structure of a Tet repressor-tetracycline complex, as described in Hinrichs, W. et al. 

25 (1994) Science 264:418-420, can be used for rational design of mutated Tet repressors. 

Based upon this structure, amino acid position 71 is located outside the tetracyline binding 
pocket, suggesting mutation at this site may not be necessary to achieve the desired 
functional properties of a mutated Tet repressor of the invention. In contrast, amino acid 
positions 95, 101 and 102 are located within the conserved tetracyline binding pocket. Thus, 

30 the tetracycline binding pocket of a Tet repressor may be targeted for mutation to create a 
mutated Tet repressor of the invention. 

Additional mutated Tet repressors for incorporation into a fusion protein of the 
invention can be created according to the teachings of the invention. A number of different 
classes of Tet repressors have been described, e.g., A, B, C, D and E (of which theTnlO- 

35 encoded repressor is a class B repressor). The amino acid sequences of the different classes 
of Tet repressors share a high degree of homology (i.e., 40-60 % across the length of the 
proteins), including in the region encompassing the above-described mutations. The amino 
acid sequences of various classes of Tet repressors are shown and compared in Figure 4, and 
are also described in Tovar, K. et al. (1988) Mol Gen. Genet. 215:76-80. Accordingly, 



equivalent mutations to those described above for the TnlO-derived Tet repressor can be 
made in other classes of Tet repressors for inclusion in a fusion protein of the invention. For 
example, amino acid position 95, which is an aspartic acid in all five repressor classes, can be 
mutated to asparagine in any class of repressor. Similarly, position 102, which is glycine in 
5 all five repressor classes, can be mutated to aspartic acid in any class of repressor. 

Additional suitable equivalent mutations will be apparent to those skilled in the art and can be 
created and tested for functionality by procedures described herein. Nucleotide and amino 
acid sequences of Tet repressors of the A, C, D and E classes are disclosed in Waters, S.H. et 
al. (1983) Nucl Acids Res 11:6089-6105, Unger, B. et al. (1984) Gene 11: 103-108, Unger, 

10 B. et al. (1984) Nucl Acids Res. 12:7693-7703 and Tovar, K. et al. (1988) Mol Gen. Genet 
215:76-80, respectively. These wild-type sequences can be mutated according to the 
teachings of the invention for use in the inducible regulatory system described herein. 

Alternative to the above-described mutations, additional suitable mutated Tet 
repressors (i.e., having the desired functional properties described above) can be created by 

1 5 mutagenesis of a wild type Tet repressor and selection as described in Example 1 . The 
nucleotide and amino acid sequences of wild-type class B Tet repressors are disclosed in 
Hillen, W. and Schollmeier, K. (1983) Nucl Acids Res. 1 1:525-539 and Postle, K. et al. 
(1984) Nucl. Acids Res. 12:4849-4863. The nucleotide and amino acid sequences of wild- 
type class A, C, D and E type repressors are cited above. A mutated Tet repressor can be 

20 created and selected, for example as follows: a nucleic acid (e.g., DNA) encoding a wild-type 
Tet repressor is subjected to random mutagenesis and the resultant mutated nucleic acids are 
incorporated into an expression vector and introduced into a host cell for screening. A 
screening assay is used which allows for selection of a Tet repressor which binds to a tet 
operator sequence only in the presence of tetracycline. For example, a library of mutated 

25 nucleic acids in an expression vector can be introduced into an E. coli strain in which tet 
operator sequences control the expression of a gene encoding a Lac repressor and the Lac 
repressor controls the expression of a gene encoding an selectable marker (e.g., drug 
resistance). Binding of a Tet repressor to tet operator sequences in the bacteria will inhibit 
expression of the Lac repressor, thereby inducing expression of the selectable marker gene. 

30 Cells expressing the marker gene are selected based upon the selectable phenotype (e.g., drug 
resistance). For wild-type Tet repressors, expression of the selectable marker gene will occur 
in the absence of Tc. A nucleic acid encoding a mutated Tet repressor is selected using this 
system based upon the ability of the nucleic acid to induce expression of the selectable 
marker gene in the bacteria only in the presence of Tc. 

35 A first polypeptide of the transactivator fusion protein (e.g., the mutated Tet 

repressor) has the property of binding specifically to a tet operator sequence. Each class of 
Tet repressor has a corresponding target tet operator sequence. Accordingly, the term "tet 
operator sequence" is intended to encompass all classes of tet operator sequences, e.g. class 
A, B, C, D, and E. Nucleotide sequences of these five classes of tet operators are shown in 




Figure 5 and SEQ ID NOs: 1 1-15, and are described in Waters, S.H. et al. (1983) cited supra, 
Hillen, W. and Schollenmeier, K. (1983) cited supra, Stiiber, D. and Bujard, H. (1981) Proc. 
Natl Acad Sci. USA 22:167-171, Unger, B. et al. (1984) cited supra and Tovar, K. et al. 
(1988) cited supra. In a preferred embodiment, the mutated Tet repressor is a TnlO-encoded 
5 repressor (i.e., class B) and the tet operator sequence is a class B tet operator sequence. 
Alternatively, a mutated class A Tet repressor can be used with a class A tet operator 
sequence, and so on for the other classes of Tet repressor/operators. 

Another approach for creating a mutated Tet repressor which binds to a class A tet 
operator is to further mutate the already mutated TnlO-derived Tet repressor described herein 

10 (a class B repressor) such that it no longer binds efficiently to a class B type operator but 

instead binds efficiently to a class A type operator. It has been found that nucleotide position 
6 of class A or B type operators is the critical nucleotide for recognition of the operator by its 
complimentary repressor (position 6 is a G/C pair in class B operators and an A/T pair in 
class A operators) (see Wissman et al. (1988) J. Mol Biol 202:397-406). It has also been 

1 5 found that amino acid position 40 of a class A or class B Tet repressor is the critical amino 
acid residue for recognition of position 6 of the operator (amino acid position 40 is a 
threonine in class B repressors but is an alanine in class A repressors). It still further has 
been found that substitution of Thr40 of a class B repressor with Ala alters its binding 
specificity such that the repressor can now bind a class A operator (similarly, substitution of 

20 Ala40 of a class A repressor with Thr alters its binding specificity such that the repressor can 
now bind a class B operator) (see Altschmied et al. (1988) EMBOJ. 2:401 1-4017). 
Accordingly, one can alter the binding specificity of the mutated TnlO-derived Tet repressor 
disclosed herein by additionally changing amino acid residue 40 from Thr to Ala by standard 
molecular biology techniques (e.g., site directed mutagenesis). 

25 A mutated Tet repressor having specific mutations (e.g., at positions 71, 95, 101 

and/or 102, as described above) can be created by introducing nucleotide changes into a 
nucleic acid encoding a wild-type repressor by standard molecular biology techniques, e.g. 
site directed mutagenesis or PCR-mediated mutagenesis using oligonucleotide primers 
incorporating the nucleotide mutations. Alternatively, when a mutated Tet repressor is 

30 identified by selection from a library, the mutated nucleic acid can be recovered from the 
library vector. To create a transactivator fusion protein of the invention, a nucleic acid 
encoding a mutated Tet repressor is then ligated in-frame to another nucleic acid encoding a 
transcriptional activation domain and the fusion construct is incorporated into a recombinant 
expression vector. The transactivator fusion protein can be expressed by introducing the 

35 recombinant expression vector into a host cell or animal. 



B. The second polypeptide of the transactivator fusion protein 

The first polypeptide of the transactivator fusion protein is operatively linkecTto a 
second polypeptide which directly or indirectly activates transcription in eukaryotic cells. To 




operatively link the first and second polypeptides, typically nucleotide sequences encoding 
the first and second polypeptides are ligated to each other in-frame to create a chimeric gene 
encoding a fusion protein, although the first and second polypeptides can be operatively 
linked by other means that preserve the function of each polypeptide (e.g., chemically 
5 crosslinked). In a preferred embodiment, the second polypeptide of the transact vator itself 
possesses transcriptional activation activity (i.e., the second polypeptide directly activates 
transcription). In another embodiment, the second polypeptide activates transcription by an 
indirect mechanims, through recruitment of a transcriptional activation protein to interact 
with the fusion protein. Accordingly, the term M a polypeptide which activates transcription in 

10 eukaryotic cells" as used herein is intended to include polypeptides which either directly or 
indirectly activates transcription. 

Polypeptides which can function to activate transcription in eukaryotic cells are well 
known in the art. In particular, transcriptional activation domains of many DNA binding 
proteins have been described and have been shown to retain their activation function when 

1 5 the domain is transferred to a heterologous protein. A preferred polypeptide for use in the 

fusion protein of the invention is the herpes simplex virus virion protein 1 6 (referred to herein 
as VP16, the amino acid sequence of which is disclosed in Triezenberg, S.J. et al. (1988) 
Genes Dev. 2:718-729). In one embodiment, about 127 of the C-terminal amino acids of 
VP 16 are used. For example, a polypeptide having an amino acid sequence shown in SEQ ID 

20 NO: 2 (positions 208-335) can be used as the second polypeptide in the fusion protein. In 

another embodiment, at least one copy of about 1 1 amino acids from the C-terminal region of 
VP 16 which retain transcriptional activation ability is used as the second polypeptide. 
Preferably, a dimer of this region (i.e., about 22 amino acids) is used. Suitable C-terminal 
peptide portions of VP16 are described in Seipel, K. et al. (EMBOJ. (1992) 13:4961-4968). 

25 For example, a dimer of a peptide having an amino acid sequence shown in SEQ ID NO: 4 
(encoded by a nucleotide sequence shown in SEQ ID NO: 3) can be used as the second 
polypeptide in the fusion protein. 

Other polypeptides with transcriptional activation ability in eukaryotic cells can be 
used in the fusion protein of the invention. Transcriptional activation domains found within 

30 various proteins have been grouped into categories based upon similar structural features. 
Types of transcriptional activation domains include acidic transcription activation domains, 
proline-rich transcription activation domains, serine/threonine-rich transcription activation 
domains and glutamine-rich transcription activation domains. Examples of acidic 
transcriptional activation domains include the VP 16 regions already described and amino 

35 acid residues 753-881 of GAL4. Examples of proline-rich activation domains include amino 
acid residues 399-499 of CTF/NF1 and amino acid residues 31-76 of AP2. Examples of 
serine/threonine-rich transcription activation domains include amino acid residues 1-427 of 
ITF1 and amino acid residues 2-451 of ITF2. Examples of glutamine-rich activatioif domains 
include amino acid residues 175-269 of Oct 1 and amino acid residues 132-243 of Spl. The 




amino acid sequences of each of the above described regions, and of other useful 
transcriptional activation domains, are disclosed in Seipel, K. et al. {EMBOJ. (1992) 
11:4961-4968). 

In addition to previously described transcriptional activation domains, novel 
5 transcriptional activation domains, which can be identified by standard techniques, are within 
the scope of the invention. The transcriptional activation ability of a polypeptide can be 
assayed by linking the polypeptide to another polypeptide having DNA binding activity and 
determining the amount of transcription of a target sequence that is stimulated by the fusion 
protein. For example, a standard assay used in the art utilizes a fusion protein of a putative 

10 transcriptional activation domain and a GAL4 DNA binding domain (e.g., amino acid 

residues 1-93). This fusion protein is then used to stimulate expression of a reporter gene 
linked to GAL4 binding sites (see e.g., Seipel, K. et al. (1992) EMBOJ. 11:4961-4968 and 
references cited therein). 

In another embodiment, the second polypeptide of the fusion protein indirectly 

15 activates transcription by recruiting a transcriptional activator to interact with the fusion 

protein. For example, a mutated tetR of the invention can be fused to a polypeptide domain 
(e.g., a dimerization domain) capable of mediating a protein-protein interaction with a 
transcriptional activator protein, such as an endogenous activator present in a host cell. It has 
been demonstrated that functional associations between DNA binding domains and 

20 transactivation domains need not be covalent (see e.g., Fields and Song (1989) Nature 
240:245-247; Chien et al. (1991) Proa Natl. Acad. Sci. USA ££9578-9582; Gyuris et al. 
(1993) Cell 25:791-803; andZervos, A.S. (1993) Cell 22:223-232). Accordingly, the second 
polypeptide of the fusion protein may not directly activate transcription but rather may form a 
stable interaction with an endogenous polypeptide bearing a compatible protein-protein 

25 interaction domain and transactivation domain. Examples of suitable interaction (or 

dimerization) domains include leucine zippers (Landschulz et al. (1989) Science 242:1681- 
1688), helix-loop-helix domains (Murre, C. et al. (1989) Cell 5£:537-544) and zinc finger 
domains (Frankel, A.D. et al. (1988) Science 240:70-73). Interaction of a dimerization 
domain present in the fusion protein with an endogeneous nuclear factor results in 

30 recruitment of the transactivation domain of the nuclear factor to the fusion protein, and 
thereby to a tet operator sequence to which the fusion protein is bound. 

C A third polypeptide of the transactivator fusion protein 

In addition to a mutated Tet repressor and a transcriptional activation domain, a fusion 
35 protein of the invention can contain an operatively linked third polypeptide which promotes 
transport of the fusion protein to a cell nucleus. Amino acid sequences which, when included 
in a protein, function to promote ^ansport of the protein to the nucleus are known in the art 
and are termed nuclear localization signals (NLS). Nuclear localization signals typically are 
composed of a stretch of basic amino acids. When attached to a heterologous protein (e.g., a 




fusion protein of the invention), the nuclear localization signal promotes transport of the 
protein to a cell nucleus. The nuclear localization signal is attached to a heterologous protein 
such that it is exposed on the protein surface and does not interfere with the function of the 
protein. Preferably, the NLS is attached to one end of the protein, e.g. the N-terminus. The 
5 amino acid sequence of a non-limiting exampleof an NLS that can be included in a fusion 
protein of the invention is shown in SEQ ID NO: 5. Preferably, a nucleic acid encoding the 
nuclear localization signal is spliced by standard recombinant DNA techniques in-frame to 
the nucleic acid encoding the fusion protein (e.g., at the 5* end). 

10 The plasmid pUHDl 7-1 (described in further detail in Example 1), which comprises a 

transactivator of the invention having the nucleotide sequence shown in SEQ ID NO: 1, has 
been deposited on July 8, 1994 under the provisions of the Budapest Treaty at the Deutsche 
Sammlung Von Mikroorganismen und ZellKulturen GmbH (DSM) in Braunschweig, 
Germany and assigned deposit number DSM 9279. 
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lh Expression of a Transactivator Fusion Protein 

A. Expression Vectors 

A nucleic acid of the invention encoding a transactivator fusion protein, as described 

20 above, can be incorporated into a recombinant expression vector in a form suitable for 

expression of the fusion protein in a host cell. The term "in a form suitable for expression of 
the fusion protein in a host cell" is intended to mean that the recombinant expression vector 
includes one or more regulatory sequences operatively linked to the nucleic acid encoding the 
fusion protein in a manner which allows for transcription of the nucleic acid into mRNA and 

25 translation of the mRNA into the fusion protein. The term "regulatory sequence" is art- 
recognized and intended to include promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are known to those 
skilled in the art and are described in Goeddel, Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, CA (1990). It should be understood that the 

30 design of the expression vector may depend on such factors as the choice of the host cell to 
be transfected and/or the amount of fusion protein to be expressed. 

When used in mammalian cells, a recombinant expression vector's control functions 
are often provided by viral genetic material. For example, commonly used promoters are 
derived from polyoma, Adenovirus 2, cytomegalovirus and Simian Virus 40. Use of viral 

35 regulatory elements to direct expression of the fusion protein can allow for high level 
constitutive expression of the fusion protein in a variety of host cells. In a preferred 
recombinant expression vector the sequences encoding the fusion protein are flanked 
upstream (i.e., 5 1 ) by the human cytomegalovirus IE promoter and downstream (i.e., 3') by an 
SV40 poly(A) signal. For example, an expression vector similar to that described in Example 




1 can be used. The human cytomegalovirus IE promoter is described in Boshart et al. (1985) 
Cell 11:521-530. Other ubiquitously expressing promoters which can be used include the 
HSV-Tk promoter (disclosed in McKnight et al. (1984) Cell 22:253-262) and p-actin 
promoters (e.g., the human P-actin promoter as described by Ng et al. (1985) Mol. Cell Biol 
5 1:2720-2732). 

Alternatively, the regulatory sequences of the recombinant expression vector can 
direct expression of the fusion protein preferentially in a particular cell type, i.e., tissue- 
specific regulatory elements can be used. Non-limiting examples of tissue-specific promoters 
which can be used include the albumin promoter (liver-specific; Pinkert et al. (1987) Genes 

10 Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton (1988) Adv. Immunol 
41:235-275), in particular promoters of T cell receptors (Winoto and Baltimore (1989) 
EMBOJ. £:729-733) and immunoglobulins (Banerji et al. (1983) Cell 22:729-740; Queen 
and Baltimore (1983) Cell 21:74 1-748), neuron-specific promoters (e.g., the neurofilament 
promoter; Byrne and Ruddle (1989) Proc. Natl Acad. ScL USA 56:5473-5477), pancreas- 

15 specific promoters (Edlund et al. (1985) Science 220:912-916), and mammary gland-specific 
promoters (e.g., milk whey promoter; U.S. Patent No. 4,873,316 and European Application 
Publication No. 264,166). Developmentally-regulated promoters are also encompassed, for 
example the murine hox promoters (Kessel and Gruss (1990) Science 242:374-379) and the 
ct-fetoprotein promoter (Campes and Tilghman (1989) Genes Dev. 2:537-546). 

20 Alternatively, a self-regulating construct encoding a transactivator fusion protein can 

be created. To accomplish this, nucleic acid encoding the fusion protein is operatively linked 
to a minimal promoter sequence and at least one tet operator sequence. For example, the 
nucleic acid of SEQ ID NO: 1 can be linked to a promoter having a nucleotide sequence 
shown in SEQ ID NO: 8, 9 or 10 (the nucleic acids of SEQ ID NOs: 8 and 9 comprise a 

25 minimal CMV promoter and ten tet operators; the nucleic acids of SEQ ID NO: 10 comprises 
a TK promoter and ten tet operators). A schematic diagram of such a self-regulating 
construct is shown in Figure 9B. When this nucleic acid is introduced into a cell (e.g., in a 
recombinant expression vector), a small amount of basal transcription of the transactivator 
gene is likely to occur due to "leakiness". In the presence of Tc (or analogue thereof) this 

30 small amount of the transactivator fusion protein will bind to the tet operator sequence(s) 
upstream of the nucleotide sequence encoding the transactivator and stimulate additional 
transcription of the nucleotide sequence encoding the transactivator, thereby leading to 
further production of the transactivator fusion protein in the cell. It will be appreciated by 
those skilled in the art that such a self-regulating promoter can also be used in conjunction 

35 with other tetracycline-regulated transactivators, such as the wild-type Tet repressor fusion 
protein (tTA) described in Gossen, M. and Bujard, H. (1992) Proc. Natl Acad. ScL USA 
£2:5547-5551, which binds to tet operators in the absence of Tc (as illustrated in Figure 9A). 
When used in conjunction with this transactivator, self-regulated transcription of the" 
nucleotide sequence encoding this transactivator is stimulated in the absence of Tc. The 




plasmid pUHD15-3, which comprises nucleotide sequences encoding the tTA described in 
Gossen and Bujard (1992), cited supra, operatively linked to a self-regulating promoter, has 
been deposited on July 8, 1994 under the provisions of the Budapest Treaty at the Deutsche 
Sammlung Von Mikroorganismen und ZellKulturen GmbH 0SM) in Braunschweig, 
5 Germany and assigned deposit number DSM 9280. 

In one embodiment, the recombinant expression vector of the invention is a plasmid, 
such as that described in Example 1. Alternatively, a recombinant expression vector of the 
invention can be a virus, or portion thereof, which allows for expression of a nucleic acid 
introduced into the viral nucleic acid. For example, replication defective retroviruses, 

10 adenoviruses and adeno-associated viruses can be used. Protocols for producing recombinant 
retroviruses and for infecting cells in vitro or in vivo with such viruses can be found in 
Current Protocols in Molecular Biology . Ausubel, F.M. et al. (eds.) Greene Publishing 
Associates, (1989), Sections 9.10-9.14 and other standard laboratory manuals. Examples of 
suitable retroviruses include pLJ, pZIP, pWE and pEM which are well known to those skilled 

1 5 in the art. Examples of suitable packaging virus lines include \|/Crip, \j/Cre, \j/2 and xj/Am. 
The genome of adenovirus can be manipulated such that it encodes and expresses a 
transactivator fusion protein but is inactivated in terms of its ability to replicate in a normal 
lytic viral life cycle. See for example Berkner et al. (1988) BioTechniques 6:616; Rosenfeld 
et al. (1991) Science 252:431-434; and Rosenfeld et al. (1992) Cell 68:143-155. Suitable 

20 adenoviral vectors derived from the adenovirus strain Ad type 5 dl324 or other strains of 
adenovirus (e.g., Ad2, Ad3, Ad7 etc.) are well known to those skilled in the art. 
Alternatively, an adeno-associated virus vector such as that described in Tratschin et al. 
(1985) Mol Cell Biol. 5:3251-3260 can be used to express a transactivator fusion protein. 

25 B. Host Cells 

A fusion protein of the invention is expressed in a eukaryotic cell by introducing 
nucleic acid encoding the fusion protein into a host cell, wherein the nucleic acid is in a form 
suitable for expression of the fusion protein in the host cell. For example, a recomM&ant 
expression vector of the invention, encoding the fusion protein, is introduced into a fitost cell. 
30 Alternatively, nucleic acid encoding the fusion protein which is operatively linked to 

regulatory sequences (e.g., promoter sequences) but without additional vector sequences can 
be introduced into a host cell. As used herein, the term "host cell" is intended to include any 
eukaryotic cell or cell line so long as the cell or cell line is not incompatible with the protein 
to be expressed, the selection system chosen or the fermentation system employed. Non- 
35 limiting examples of mammalian cell lines which can be used include CHO dhfr" cells 

(Urlaub and Chasin (1980) Proc. Natl Acad. Sci. USA 21:42 16-4220), 293 cells (Graham et 
al. (1977) J. Gen. Virol 2£: pp59) or myeloma cells like SP2 or NS0 (Galfre and Milstein 
(1981) Meth Enzymol 22£B):3-46). 
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In addition to cell lines, the invention is applicable to normal cells, such as cells to be 
modified for gene therapy purposes or embryonic cells modified to create a transgenic or 
homologous recombinant animal. Examples of cell types of particular interest for gene 
therapy purposes include hematopoietic stem cells, myoblasts, hepatocytes, lymphocytes, 
5 neuronal cells and skin epithelium and airway epithelium. Additionally, for transgenic or 
homologous recombinant animals, embryonic stem cells and fertilized oocytes can be 
modified to contain nucleic acid encoding a transactivator fusion protein. Moreover, plant 
cells can be modified to create transgenic plants. 

The invention is broadly applicable and encompasses non-mammalian eukaryotic 

10 cells as well, including insect (e.g,. Sp. frugiperda), yeast (e.g., S. cerevisiae, S. pombe, P. 

pastoris, K. lactis, H. polymorpha; as generally reviewed by Fleer, R. (1992) Current Opinion 
in Biotechnology l{5}:486-496)), fungal and plant cells. Examples of vectors for expression 
in yeast S. cerivisae include pYepSecl (Baldari. el aL, (1987) Embo J, 6:229-234), pMFa 
(Kurjan and Herskowitz, (1982) Cell 311:933-943), pJRY88 (Schultz el aL, (1987) Gene 

15 54:1 13-123), and pYES2 (Invitrogen Corporation, San Diego, CA). The fusion protein can be 
expressed in insect cells using baculovirus expression vectors (e.g., as described in O'Reilly 
et al. (1992) Baculovirus Expression Vectors: A Laboratory Manual, Stockton Press). 
Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., SF 9 
cells) include the pAc series (Smith et aL, (1983) Mol Cell Biol 2:2156-2165) and the pVL 

20 series (Lucklow, V.A., and Summers, M.D., (1989) Virology 12Q:31-39). 

C. Introduction of Nucleic Acid into a Host Cell 

Nucleic acid encoding the fusion protein can be introduced into a host cell by standard 
techniques for transfecting eukaryotic cells. The term "transfecting" or "transfection" is 

25 intended to encompass all conventional techniques for introducing nucleic acid into host 
cells, including calcium phosphate co-precipitation, DEAE-dextran-mediated transfection, 
lipofection, electroporation and microinjection. Suitable methods for transfecting host cells 
can be found in Sambrook et al* {Molecular Cloning: A Laboratory Manual, 2nd Edition, 
Cold Spring Harbor Laboratory press (1989)), and other laboratory textbooks. 

30 The number of host cells transformed with a nucleic acid of the invention will depend, 

at least in part, upon the type of recombinant expression vector used and the type of 
transfection technique used. Nucleic acid can be introduced into a host cell transiently, or 
more typically, for long term regulation of gene expression, the nucleic acid is stably 
integrated into the genome of the host cell or remains as a stable episome in the host cell. 

35 Plasmid vectors introduced into mammalian cells are typically integrated into host cell DNA 
at only a low frequency. In order to identify these integrants, a gene that contains a selectable 
marker (e.g., drug resistance) is generally introduced into the host cells along with the nucleic 
acid of interest. Preferred selectable markers include those which confer resistance to certain 
drugs, such as G418 and hygromycin. Selectable markers can be introduced on a separate 



plasmid from the nucleic acid of interest or, are introduced on the same plasmid. Host cells 
transfected with a nucleic acid of the invention (e.g., a recombinant expression vector) and a 
gene for a selectable marker can be identified by selecting for cells using the selectable 
marker. For example, if the selectable marker encodes a gene conferring neomycin 
5 resistance, host cells which have taken up nucleic acid can be selected with G41 8. Cells that 
have incorporated the selectable marker gene will survive, while the other cells die. 

A host cell transfected with a nucleic acid encoding a fusion protein of the invention 
can be further transfected with one or more nucleic acids which serve as the target for the 
fusion protein. The target nucleic acid comprises a nucleotide sequence to be transcribed 
10 operatively linked to at least one tet operator sequence (described in more detail in Section III 
below). 

Nucleic acid encoding the fusion protein of the invention can be introduced into 
eukaryotic cells growing in culture in vitro by conventional transfection techniques (e.g., 
calcium phosphate precipitation, DEAE-dextran transfection, electroporation etc.). Nucleic 

15 acid can also be transferred into cells in vivo, for example by application of a delivery 
mechanism suitable for introduction of nucleic acid into cells in vivo, such as retroviral 
vectors (see e.g., Ferry, N et al. (1991) Proc. Natl Acad. Sci. USA ££l8377-8381; and Kay, 
M.A. et al. (1992) Human Gene Therapy 3:641-647), adenoviral vectors (see e.g., Rosenfeld, 
M.A. (1992) Ce//6£:143-155; and Herz, J. and Gerard, R.D. (1993) Proc. Natl. Acad. Sci. 

20 USA 20:2812-2816), receptor-mediated DNA uptake (see e.g., Wu, G. and Wu, C.H. (1988) 
J. Biol. Chem. 263:14621; Wilson et al. (1992) J. Biol Chem. 262:963-967; and U.S. Patent 
No. 5,166,320), direct injection of DNA (see e.g., Acsadi et al. (1991) Nature 222l 815-818; 
and Wolff et al. (1990) Science 242:1465-1468) or particle bombardment (see e.g., Cheng, L. 
et al. (1993) Proc. Natl Acad. Sci. USA 20:4455-4459; and Zelenin, A.V. et al. (1993) FEBS 

25 Letters H£:29-32). Thus, for gene therapy purposes, cells can be modified in vitro and 
administered to a subject or, alternatively, cells can be directly modified in vivo. 

D. Transgenic Organisms 

Nucleic acid a transactivator fusion protein can transferred into a fertilized oocyte of a 

30 non-human animal to create a transgenic animal which expresses the fusion protein of the 
invention in one or more cell types. A transgenic animal is an animal having cells that 
contain a transgene, wherein the transgene was introduced into the animal or an ancestor of 
the animal at a prenatal, e.g., an embryonic, stage. A transgene is a DNA which is integrated 
into the genome of a cell from which a transgenic animal develops and which remains in the 

35 genome of the mature animal, thereby directing the expression of an encoded gene product in 
one or more cell types or tissues of the transgenic animal. In one embodiment, the non- 
human animal is a mouse, although the invention is not limited thereto. In other 
embodiments, the transgenic animal is a goat, sheep, pig, cow or other domestic farm animal. 



Such transgenic animals are useful for large scale production of proteins (so called "gene 
pharming"). 

A transgenic animal can be created, for example, by introducing a nucleic acid 
encoding the fusion protein (typically linked to appropriate regulatory elements, such as a 
5 constitutive or tissue-specific enhancer) into the male pronuclei of a fertilized oocyte, e.g., by 
microinjection, and allowing the oocyte to develop in a pseudopregnant female foster animal. 
Intronic sequences and polyadenylation signals can also be included in the transgene to 
increase the efficiency of expression of the transgene. Methods for generating transgenic 
animals, particularly animals such as mice, have become conventional in the art and are 

10 described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009 and Hogan, B. et al., 
(1986) A Laboratory Manual, Cold Spring Harbor, New York, Cold Spring Harbor 
Laboratory. A transgenic founder animal can be used to breed additional animals carrying 
the transgene. Transgenic animals carrying a transgene encoding the fusion protein of the 
invention can further be bred to other transgenic animals carrying other transgenes, e.g., to a 

1 5 transgenic animal which contains a gene operatively linked to a tet operator sequence 
(discussed in more detail in Section III below). 

It will be appreciated that, in addition to transgenic animals, the regulatory system 
described herein can be applied to other transgenic organisms, such as transgenic plants. 
Transgenic plants can be made by conventional techniques known in the art. Accordingly, 

20 the invention encompasses non-human transgenic organisms, including animals and plants, 
that contains cells which express the transactivator fusion protein of the invention (i.e., a 
nucleic acid encoding the transactivator is incorporated into one or more chromosomes in 
cells of the transgenic organism). 

25 E. Homologous Recombinant Organisms 

The invention also provides a homologous recombinant non-human organism 
expressing the fusion protein of the invention. The term "homologous recombinant 
organism" as used herein is intended to describe an organism, e.g. animal or plant, containing 
a gene which has been modified by homologous recombination between the gene and a DNA 

30 molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal. In one 
embodiment, the non-human animal is a mouse, although the invention is not limited thereto. 
An animal can be created in which nucleic acid encoding the fusion protein has been 
introduced into a specific site of the genome, i.e., the nucleic acid has homologously 
recombined with an endogenous gene. 

35 To create such a homologous recombinant animal, a vector is prepared which contains 

DNA encoding the fusion protein flanked at its 5 ! and 3* ends by additional nucleic acid of a 
eukaryotic gene at which homologous recombination is to occur. The additional nucleic acid 
flanking that encoding the fusion protein is of sufficient length for successful homologous 
recombination with the eukaryotic gene. Typically, several kilobases of flanking DNA (both 




at the 5* and 3 f ends) are included in the vector (see e.g., Thomas, K.R. and Capecchi, M. R. 
(1987) Cell 11:503 for a description of homologous recombination vectors). The vector is 
introduced into an embryonic stem cell line (e.g., by electroporation) and cells in which the 
introduced DNA has homologously recombined with the endogenous DNA are selected (see 
5 e.g., Li, E. et al. (1992) Cell 62:915). The selected cells are then injected into a blastocyst of 
an animal (e.g., a mouse) to form aggregation chimeras (see e.g., Bradley, A. in 
Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, E.J. Robertson, ed. 
(IRL, Oxford, 1987) pp. 1 13-152). A chimeric embryo can then be implanted into a suitable 
pseudopregnant female foster animal and the embryo brought to term. Progeny harbouring 

10 the homologously recombined DNA in their germ cells can be used to breed animals in which 
all cells of the animal contain the homologously recombined DNA. These "germline 
transmission" animals can further be mated to animals carrying a gene operatively linked to 
at least one tet operator sequence (discussed in more detail in Section HI below). 

In addition to the homologous recombination approaches described above, enzyme- 

1 5 assisted site-specific integration systems are known in the art and can be applied to the 
components of the regulatory system of the invention to integrate a DNA molecule at a 
predetermined location in a second target DNA molecule. Examples of such enzyme-assisted 
integration systems include the Cre recombinase-Iox target system (e.g., as described in 
Baubonis, W. and Sauer, B. (1993) Nucl Acids Res. 21:2025-2029; and Fukushige, S. and 

20 Sauer, B. (1992) Proc. Natl Acad. Set USA 52:7905-7909) and the FLP recombinase-FRT 
target system (e.g., as described in Dang, D.T. and Perrimon, N. (1992) Dev. Genet. 12:367- 
375; and Fiering, S. et al. (1993) Proc. Natl Acad. Sci. USA 20:8469-8473). 

III. Target T ranscription Units Regulated by a Tetracycline-Inducible Transactivator 
25 A fusion protein of the invention is used to regulate the transcription of a target 

nucleotide sequence. This target nucleotide sequence is operatively linked to a regulatory 
sequence to which the fusion protein binds. More specifically, the fusion protein regulates 
expression of a nucleotide sequence operatively linked to at least one tet operator sequence. 
Accordingly, another aspect of the invention relates to target nucleic acids (e.g., DNA 
30 molecules) comprising a nucleotide sequence to be transcribed operatively linked to at least 
one tet operator sequence. Such nucleic acids are also referred to herein as fe/-regulated 
transcription units (or simply transcription units). 

Within a transcription unit, the "nucleotide sequence to be transcribed" typically 
includes a minimal promoter sequence which is not itself transcribed but which serves (at 
35 least in part) to position the transcriptional machinery for transcription. The minimal 
promoter sequence is linked to the transcribed sequence in a 5 f to 3' direction by 
phosphodiester bonds (i.e., the promoter is located upstream of the transcribed sequence) to 
form a contiguous nucleotide sequence. Accordingly, as used herein, the terms "nucleotide 
sequence to be transcribed" or "target nucleotide sequence" are intended to include both the 
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nucleotide sequence which is transcribed into mRNA and an operatively linked upstream 
minimal promoter sequence. The term "minimal promoter" is intended to describe a partial 
promoter sequence which defines the start site of transcription for the linked sequence to be 
transcribed but which by itself is not capable of initiating transcription efficiently, if at all. 
5 Thus, the activity of such a minimal promoter is dependent upon the binding of a 

transcriptional activator (such as the tetracycline-inducible fusion protein of the invention) to 
an operatively linked regulatory sequence (such as one or more tet operator sequences). In 
one embodiment, the minimal promoter is from the human cytomegalovirus (as described in 
Boshart et al. (1985) Cell 41:52 1-530). Preferably, nucleotide positions between about +75 

10 to -53 and +75 to -3 1 are used. Other suitable minimal promoters are known in the art or can 
be identified by standard techniques. For example, a functional promoter which activates 
transcription of a contiguously linked reporter gene (e.g., chloramphenicol acetyl transferase, 
p-galactosidase or luciferase) can be progressively deleted until it no longer activates 
expression of the reporter gene alone but rather requires the presence of an additional 

1 5 regulatory sequence(s). 

Within a transcription unit, the target nucleotide sequence (including the transcribed 
nucleotide sequence and its upstream minimal promoter sequence) is operatively linked to at 
least one tet operator sequence. In a typical configuration, the tet operator sequence(s) is 
operatively linked upstream (i.e., 5 f ) of the minimal promoter sequence through a 

20 phosphodiester bond at a suitable distance to allow for transcription of the target nucleotide 
sequence upon binding of a regulatory protein (e.g., the transactivator fusion protein) to the 
tet operator sequence. That is, the transcription unit is comprised of, in a 5* to 3' direction: tet 
operator sequence(s) - a minimal promoter - a transcribed nucleotide sequence. It will be 
appreciated by those skilled in the art that there is some flexibility in the permissable distance 

25 between the tet operator sequence(s) and the minimal promoter, although typically the tet 

operator sequences will be located within about 200-400 base pairs upstream of the minimal 
promoter. 

The nucleotide sequences of examples of tef-regulated promoters, containing tet 
operator sequences linked to a minimal promoter, that can be used in the invention are shown 

30 in SEQ ID NO: 8-10. The nucleotide sequences of SEQ ID NOs: 8 and 9 comprise a 

cytomegalovirus minimal promoter linked to ten tet operator sequences; the two nucleotide 
sequences differ in the distance between the operators and the first transcribed nucleotide. 
The nucleotide sequence of SEQ ID NO: 10 comprises a herpes simplex virus minimal tk 
promoter linked to ten tet operator sequences. The promoter of SEQ ID NO: 8 corresponds 

35 to PhCMV*" 1 * described in Gossen, M. and Bujard, H. (1992) Proc. Natl Acad. Sci. USA 

22:5547-5551. The promoter of SEQ ID NO: 9 corresponds to PhCMV*" 2 > als ° described in 
Gossen, M. and Bujard, H, cited supra. 

Alternatively, since regulatory elements have been observed in the art to function 
downstream of sequences to be transcribed, it is likely that the tet operator sequence(s) can be 



operativeiy linked downstream (i.e., 3') of the transcribed nucleotide sequence. Thus, in this 
configuration, the transcription unit is comprised of, in a 5 f to 3' direction: a minimal 
promoter - a transcribed nucleotide sequence - tet operator sequence(s). Again, it will be 
appreciated that there is likely to be some flexibility in the permissable distance downstream 
5 at which the tet operator sequence(s) can be linked. 

The term "tet operator sequence" is intended to encompass all classes of tet operators 
(e.g., A, B, C, D and E). A nucleotide sequence to be transcribed can be operativeiy linked to 
a single tet operator sequence, or for an enhanced range of regulation, it can be operativeiy 
linked to multiple tet operator sequences (e.g., two, three, four, five, six, seven, eight, nine, 
10 ten or more operator sequences). In a preferred embodiment, the sequence to be transcribed 
is operativeiy linked to seven tet operator sequences. 

A te/-regulated transcription unit can further be incorporated into a recombinant 
vector (e.g., a plasmid or viral vector) by standard recombinant DNA techniques. The 
transcription unit, or recombinant vector in which it is contained, can be introduced into a 
15 host cell by standard transfection techniques, such as those described above. It should be 
appreciated that, after introduction of the transcription unit into a population of host cells, it 
may be necessary to select a host cell clone which exhibit low basal expression of the tet 
operator-linked nucleotide sequence (i.e., selection for a host cell in which the transcription 
unit has integrated at a site that results in low basal expression of the tet operator-linked 
20 nucleotide sequence). Furthermore, a fe/-regulated transcription unit can be introduced, by 
procedures described above, into the genome of a non-human animal at an embryonic stage 
or into plant cells to create a transgenic or homologous recombinant organims carrying the 
transcription unit in some or all of its cells. Again, it should be appreciated that it may be 
necessary to select a transgenic or homologous organism in which there is low basal 
25 expression of the the tet operator-linked nucleotide sequence in cells of interest. 

In one embodiment, the target nucleotide sequence of the ^/-regulated transcription 
unit encodes a protein of interest. Thus, upon induction of transcription of the nucleotide 
sequence by the transactivator of the invention and translation of the resultant mRNA, the 
protein of interest is produced in a host cell or animal. Alternatively, the nucleotide sequence 
30 to be transcribed can encode for an active RNA molecule, e.g., an antisense RNA molecule or 
ribozyme. Expression of active RNA molecules in a host cell or animal can be used to 
regulate functions within the host (e.g., prevent the production of a protein of interest by 
inhibiting translation of the mRNA encoding the protein). 

A transactivator of the invention can be used to regulate transcription of an exogenous 
35 nucleotide sequence introduced into the host cell or animal. An "exogenous" nucleotide 
sequence is a nucleotide sequence which is introduced into the host cell and typically is 
inserted into the genome of tlu host. The exogenous nucleotide sequence may not be present 
elsewhere in the genome of the host (e.g., a foreign nucleotide sequence) or may be an 
additional copy of a sequence which is present within the genome of the host but which is 




integrated at a different site in the genome. An exogenous nucleotide sequence to be 
transcribed and an operatively linked tet operator sequence(s) can be contained within a 
single nucleic acid molecule which is introduced into the host cell or animal. 

Alternatively, a transactivator of the invention can be used to regulate transcription of 
5 an endogenous nucleotide sequence to which a tet operator sequence(s) has been linked. An 
"endogenous" nucleotide sequence is a nucleotide sequence which is present within the 
genome of the host. An endogenous gene can be operatively linked to a tet operator 
sequence(s) by homologous recombination between a te/O-containing recombination vector 
and sequences of the endogeneous gene. For example, a homologous recombination vector 
10 can be prepared which includes at least one tet operator sequence and a miminal promoter 
sequence flanked at its 3' end by sequences representing the coding region of the endogenous 
gene and flanked at its 5' end by sequences from the upstream region of the endogenous gene 
by excluding the actual promoter region of the endogenous gene. The flanking sequences are 
of sufficient length for successful homologous recombination of the vector DNA with the 
15 endogenous gene. Preferably, several kilobases of flanking DNA are included in the 

homologous recombination vector. Upon homologous recombination between the vector 
DNA and the endogenous gene in a host cell, a region of the endogenous promoter is replaced 
by the vector DNA containing one or more tet operator sequences operably linked to a 
minimal promoter. Thus, expression of the endogenous gene is no longer under the control 
20 of its endogenous promoter but rather is placed under the control of the tet operator 
sequence(s) and the minimal promoter. 

In another embodiment, tet operator sequences can be inserted elsewhere within an 
endogenous gene, preferably within a 5'or 3' regulatory region, via homologous 
recombination to create an endogenous gene whose expression can be regulated by a 
25 tetracycline-regulated fusion protein described herein. For example, one or more tetO 

sequences can be inserted into a promoter or enhancer region of an endogenous gene such 
that promoter or enhancer function is maintained (i.e., the tetO sequences are introduced into 
a site of the promoter/enhancer region that is not critical for promoter/enhancer function). 
Regions within promoters or enhancers which can be altered without loss of 
30 promoter/enhancer function are known in the art for many genes or can be determined by 
standard techniques for analyzing critical regulatory regions. An endogenous gene having 
tetO sequences inserted into a non-critical regulatory region will retain the ability to be 
expressed in its normal constitutive and/or tissue-specific manner but, additionally, can be 
downregulated by a tetracycline-controlled transcriptional inhibitor protein in a controlled 
35 manner. For example, constitutive expression of such a modified endogenous gene can be 

inhibited by in the presence of tetracycline (or analogue) using an inhibitor fusion protein that 
binds to tetO sequences in the presence of tetracycline (or analogue) (as described in further 
detail in Section IV and Section VI, Part B, below). 
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A. Regulation of Expression of tet Operator-Linked Nucleotide Sequences 

Expression of a tet operator-linked nucleotide sequences is regulated by a 
transactivator fusion protein of the invention. Thus, the fusion protein and the target nucleic 
acid are both present in a host cell or organism. The presence of both the transactivator 
5 fusion protein and the target transcription unit in the same host cell or organism can be 

achieved in a number of different ways. For example, a host cell can be transfected with one 
nucleic acid of the expression system (e.g., encoding the transactivator fusion protein), stably 
transfected cells can be selected and then the transfected cells can be re-transfected (also 
referred to as "supertransfected") with nucleic acid corresponding to the other nucleic acid of 

10 the expression system (e.g., the target nucleic acid to be transcribed). Two distinct selectable 
markers can be used for selection, e.g., uptake of the first nucleic acid can be selected with 
G418 and uptake of the second nucleic acid can be selected with hygromycin. Alternatively, 
a single population of cells can be transfected with nucleic acid corresponding to both 
components of the system. Accordingly, the invention provides a nucleic acid composition 

15 comprising: 

• a first nucleic acid encoding a fusion protein which activates transcription, the fusion 
protein comprising a first polypeptide which binds to a tet operator sequence in the 
presence of tetracycline or a tetracycline analogue operatively linked to a second 

20 polypeptide which activates transcription in eukaryotic cells; and 

• a second nucleic acid comprising a nucleotide sequence to be transcribed operatively 



25 In one embodiment, the two nucleic acids are two separate molecules (e.g., two different 
vectors). In this case, a host cell is cotransfected with the two nucleic acid molecules or 
successively transfected first with one nucleic acid molecule and then the other nucleic acid 
molecule. In another embodiment, the two nucleic acids are linked (i.e., colinear) in the same 
molecule (e.g., a single vector). In this case, a host cell is transfected with the single nucleic 

30 acid molecule. 

The host cell may be a cell cultured in vitro or a cell present in vivo (e.g., a cell 
targeted for gene therapy). The host cell can further be a fertilized ooctye, embryonic stem 
cell or any other embryonic cell used in the creation of non-human transgenic or homologous 
recombinant animals. Transgenic or homologous recombinant animals which comprise both 

35 nucleic acid components of the expression system can be created by introducing both nucleic 
acids into the same cells at an embryonic stage, or more preferably, an animal which carries 
one nucleic acid component of the system in its genome is mated to an animal which carries 
the other nucleic acid component of the system in its genome. Offspring which have 
inherited both nucleic acid components can then be identified by standard techniques. 



linked to at least one tet operator sequence. 
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B. Coordinate Regulation of Expression of Two Nucleotide Sequences 

In addition to providing a system for the regulated expression of a single transcribed 
nucleotide sequence, the invention further permits coordinated regulation of the expression of 
two nucleotide sequences operatively linked to the same tet operator sequence(s). 
Accordingly, another aspect of the invention pertains to a novel te/-regulated transcription 
unit for coordinate regulation of two genes. In this transcription unit, the same tet operator 
sequence(s) regulates the expression of two operatively linked nucleotide sequences that are 
transcribed in opposite directions from the common tet operator sequence(s). Accordingly, 
one nucleotide sequence is operatively linked to one side of the tet operator sequence (e.g., 
the 5' end on the top strand of DNA) and the other nucleotide sequence is operatively linked 
to the opposite side of the tet operator sequence (e.g., the 3' end on the top strand of DNA). 
Additionally, it should be understood that each nucleotide sequence to be transcribed includes 
an operatively linked minimal promoter sequence which is located between the nucleotide 
sequence to be transcribed and the tet operator sequence(s). 

A representative example of such a transcription unit is diagrammed schematically in 
Figure 6. In this vectors, the two nucleotide sequences, operatively linked to the same tet 
operator sequence(s), are transcribed in opposite directions relative to the tet operator 
sequence(s) (i.e., the sequences are transcribed in a divergent manner upon activation by a 
transactivator fusion protein of the invention). By "transcribed in opposite directions relative 
to the tet operator sequence(s)", it is meant that the first nucleotide sequence is transcribed 5' 
to 3' from one strand of the DNA (e.g., the bottom strand) and the second nucleotide 
sequence is transcribed 5" to 3* from the other stand of the DNA (e.g., the top strand), 
resulting in bidirectional transcription away from the tet operator sequence(s). 

Accordingly, the invention provides a recombinant vector for coordinately-regulated, 
bidirectional transcription of two nucleotide sequence. In one embodiment, the vector 
comprises a nucleotide sequence linked by phosphodiester bonds comprising, in a 5 1 to 3' 
direction: 

• a first nucleotide sequence to be transcribed, operatively linked to 

• at least one tet operator sequence, operatively linked to 

• a second nucleotide sequence to be transcribed, 

wherein transcription of the first and second nucleotide sequences proceeds in opposite 
directions from the at least one tet operator sequence(s) (i.e., the first and second nucleotide 
sequences are transcribed in a divergent manner). 

In another embodiment, the vector does not include the first and second nucleotide 
sequence to be transcribed but instead contains cloning sites which allow for the introduction 




into the vector of nucleotide sequences of interest. Accordingly, in this embodiment, the 
vector comprises a nucleotide sequence comprising in a 5 f to 3 f direction: 

• a first cloning site for introduction of a first nucleotide sequence to be 
5 transcribed, operatively linked to 

• at least one tet operator sequence, operatively linked to 

• a second cloning site for introduction of a second nucleotide sequence to be 
transcribed, 

10 wherein transcription of a first and second nucleotide sequence introduced into the vector 
proceeds in opposite directions from the at least one tet operator sequence(s). It will be 
appreciated by those skilled in the art that this type of "cloning vector" may be in a form 
which also includes minimal promoter sequences such that a first nucleotide sequence 
introduced into the first cloning site is operatively linked to a first minimal promoter and a 

1 5 second nucleotide sequence introduced into the second cloning site is operatively linked to a 
second minimal promoter. Alternatively, the "cloning vector" may be in a form which does 
not include minimal promoter sequences and instead, nucleotide sequences including linked 
minimal promoter sequences are introduced into the cloning sites of the vector. 

The term "cloning site" is intended to encompass at least one restriction endonuclease 

20 site. Typically, multiple different restriction endonuclease sites (e.g., a polylinker) are 
contained within the nucleic acid. 

In yet another embodiment, the vector for coordinate, bidirectional transcription of 
two nucleotide sequences may contain a first nucleotide to be transcribed, such as that 
encoding a detectable marker (e.g., luciferase or p-galactosidase), and a cloning site for 

25 introduction of a second nucleotide sequence of interest. 

The nucleotide sequences of two different suitable bidirectional promoter regions for 
use in a vector for coordinate regulation of two nucleotide sequences to be transcribed, as 
described herein, are shown in Figures 7 A and 7B (SEQ ID NOS: 6 and 7, respectively). In 
the construct of Figure 7A, both minimal promoters present in the construct are derived from 

30 a CMV promoter. In the construct of Figure 7B, one minimal promoter present in the 

construct is derived from a CMV promoter, whereas the second minimal promoter is derived 
from a TK promoter. A plasmid pUHDG1316-8, comprising a bidirectional promoter of the 
invention, has been deposited on July 8, 1994 under the provisions of the Budapest Treaty at 
the Deutsche Sammlung Von Mikroorganismen und ZellKulturen GmbH (DSM) in 

35 Braunschweig, Germany and assigned deposit number DSM 928 1 . 

The transcription unit of the invention for bidirectional transcription of two nucleotide 
sequences operatively linked to the same tet operator sequence(s) is useful for coordinating 
the expression of the two nucleotide sequences of interest. Preferably, at least one of the 
nucleotide sequences to be transcribed is a eukaryotic nucleotide sequence. In one 




application, the vector is used to produce stoichiometric amounts of two subunits of a 
heterodimeric molecule in the same cell. For example, the vector can be used produce 
antibody heavy and light chains in the same cell or to produce growth factor receptor subunits 
in the same cells. In another application, the vector is used to express two gene products that 
cooperate in establishing a particular cellular phenotype. In yet another application, the 
vector is used to coexpress an indicator function and a gene of interest, wherein the indicator 
is utilized to monitor expression of the gene of interest. Thus, one of the two coordinately 
expressed sequences can encode a gene of interest and the other can encode a detectable 
marker, such as a surface marker or enzyme (e.g., P-galactosidase or luciferase) which is used 
for selection of cells expressing the gene of interest. 

Transcription of the two coordinately-regulated nucleotide sequences can be induced 
by tetracycline (or an analogue thereof) by use of the Tc-inducible transcriptional activator of 
the invention to regulate expression of the two nucleotide sequences. Thus, in this system, 
expression of both nucleotide sequences is "off 1 in the absence of Tc (or analogue), whereas 
expression is turned "on" by the presence of Tc (or analogue). Alternatively, the vector for 
coordinate regulation of two nucleotide sequences can be used in conjunction with other 
tetracycline-regulated transcription factors known in the art. For example, a transactivator 
fusion protein of a wild-type Tet repressor fused to a transcriptional activation domain, which 
activates gene expression in the absence of Tc (or analogue), such as the tTA described in 
Gossen, M. and Bujard, H. (1992) Proc. Natl Acad. Sci. USA £2:5547-5551, can also be used 
in conjuction with this target transcription unit for coordinate regulation. 

C. Independent Regulation of Expression of Multiple Nucleotide Sequences 

The invention still further permits independent and opposite regulation of two or 
more nucleotide sequences to be transcribed. Accordingly, another aspect of the invention 
pertains to a novel ^/-regulated transcription unit for independent regulation of two or more 
genes. To independently regulate the expression of two nucleotide sequences to be 
transcribed, one nucleotide sequence is operatively linked to a tet operator sequence(s) of one 
class type and the other nucleotide sequence is operatively linked to a tet operator 
sequence(s) of another class type. Accordingly, the invention provides at least one 
recombinant vector for independent regulation of transcription of two nucleotide sequences. 
In one embodiment, the vector(s) comprises: 

• a first nucleotide sequence to be a transcribed operatively linked to at least one 
tet operator sequence of a first class type; and 

• a second nucleotide sequence to be a transcribed operatively linked to at least 
one tet operator sequence of a second class type. 




(It should be understood that each nucleotide sequence to be transcribed also includes an 
operatively linked, upstream minimal promoter sequence.) The two independently regulated 
transcription units can be included on a single vector, or alternatively, on two seperate 
vectors. The recombinant vector(s) containing the nucleotide sequences to be transcribed can 
be introduced into a host cell or animal as described previously. 

In another embodiment, the vector(s) does not include the first and second nucleotide 
sequence to be transcribed but instead contains cloning sites which allow for the introduction 
into the vector of nucleotide sequences of interest. Accordingly, in this embodiment, the 
vector(s) comprises: 

• a first cloning site for introduction of a first nucleotide sequence to be 
transcribed operatively linked to at least one tet operator sequence of a first class type; 
and 

• a second cloning site for introduction of a second nucleotide sequence to be 
transcribed operatively linked to at least one tet operator sequence of a second class type. 

This cloning vector(s) may be in a form that already includes first and second minimal 
promoters operatively linked, respectively, to the first and second cloning sites. 
Alternatively, nucleotide sequences to be transcribed which include an operatively linked 
minimal promoter can be introduced into the cloning vector. 

In yet another embodiment, the vector for independent regulation of two nucleotide 
sequences may contain a first nucleotide to be transcribed, such as that encoding a detectable 
marker or a suicide gene, operatively linked to at least one tet operator sequence of a first 
class type and a cloning site for introduction of a second nucleotide sequence of interest such 
that it is operatively linked to at least one tet operator sequence of a second class type. 

It will be appreciated by those skilled in the art that various combinations of classes of 
tet operator sequences can be used for independent regulation of two nucleotide sequences. 
For example, the first tet operator sequence(s) can be of the class A type and the second can 
be of the class B type, or the first tet operator sequence can be of the class B type and the 
second can be of the class C type, etc. Preferably, one to the two tet operators used is a class 
B type operator. 

Independent transcription of the first and second nucleotide sequences is regulated in 
a host cell by further introducing into the host cell one or more nucleic acids encoding two 
different transactivator fusion proteins which bind independently to tet operator sequences of 
different class types. The first fusion protein comprises a polypeptide which binds to a tet 
operator sequence in the presence of tetracycline or a tetracycline analogue, operatively 
linked to a polypeptide which activates transcription in eukaryotic cells (e.g., a transactivator 
fusion protein of the invention, such as a mutated TnlO-derived Tet repressor linked to a 
VP 16 activation region). The second fusion protein comprises a polypeptide which binds to 
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a tet operator sequence in the absence of tetracycline or a tetracycline analogue, operatively 
linked to a polypeptide which activates transcription in eukaryotic cells (e.g., a wild-type 
TnlO-derived Tet repressor linked to a VP 16 activation region, such as the tTA described in 
Gossen, M. and Bujard, H. (1992) Proc. Natl Acad. ScL USA 22:5547-5551). In one 
5 embodiment, the first fusion protein binds to the tet operator sequence of the first class type 
used in the transcription unit and the second fusion protein binds to the tet operator sequence 
of the second class type used in the transcription unit. Alternatively, in another embodiment, 
the first fusion protein binds to the second class type of tet operator and the second fusion 
protein binds to the first class type of tet operator. 

10 For example, the first nucleotide sequence to be transcribed may be linked to a class 

A tet operator and the first fusion protein may bind to class A operators, whereas the second 
nucleotide sequence to be transcribed may be linked to a class B tet operator and the second 
fusion protein may bind to class B operators. Thus, in this embodiment, transcription of the 
first nucleotide sequence is activated in the presence of Tc (or analogue thereof) while 

15 transcription of the second nucleotide sequence is activated in the absence of Tc (or analogue 
thereof). Alternatively, in another embodiment, the first fusion protein binds to class B 
operators and the second fusion protein binds to class A operators. In this case, transcription 
of the second nucleotide sequence is activated in the presence of Tc (or analogue thereof) 
while transcription of the first nucleotide sequence is activated in the absence of Tc (or 

20 analogue thereof). Appropriate transactivator proteins for use in this system can be designed 
as described above in Section I and in Gossen and Bujard (1992) cited supra. In order to 
inhibit heterodimerization between the two different types of Tet repressor fusion proteins 
present in the same cell, it may be necessary to mutate the dimerization region of one or both 
of the transactivator fusion proteins. Mutations can be targeted to the C-terminal region of 

25 TetR known to be involved in dimerization. The dimerization region has been described in 
detail based upon the crystal structure of TetR (see Hinrichs, W. et al. (1994) Science 
264:4 18-420). 

This system allows for independent and opposite regulation of the expression of two 
genes by Tc and analogues thereof. Use of different Tc analogues as inducing agents may 

30 further allow for high, low or intermediate levels of expression of the different sequences 

(discussed in greater detail in Section V below). The novel transcription unit of the invention 
for independently regulating the expression of two genes, described above, can be used in 
situations where two gene products are to be expressed in the same cell but where it is 
desirable to express one gene product while expression of the other gene product is turned 

35 "off, and vice versa. For example, this system is particularly useful for expressing in the 
same host cell either a therapeutic gene or a suicide gene (i.e., a gene which encodes a 
product that can be used to destroy the cell, such as ricin or herpes simplex virus thymidine 
kinase). In many gene therapy situations, it is desirable to be able to express a gene for 
therapeutic purposes in a host cell but also to have the capacity to destroy the host cell once 




the therapy is completed. This can be accomplished using the above-described system by 
linking the therapeutic gene to one class of tet operator and the suicide gene to another class 
of tet operator. Thus, expression of the therapeutic gene in a host cell can be stimulated by 
Tc (in which case expression of the suicide gene is absent). Then, once the therapy is 
5 complete, Tc is removed, which turns off expression of the therapeutic gene and turns on 
expression of the suicide gene in the cell. 

D. Combined Coordinate and Independent Regulation of Multiple Nucleotide Sequences 
It is further possible to regulate the expression of four nucleotide sequences by 
10 combining the system described in Section IIIB with the system described in Section IIIC 
such that two pairs of sequences are coordinately regulated while one pair is independently 
regulated from the other pair. Accordingly, two target transcription units can be designed 
comprising: 

15 • a first nucleic acid comprising in a 5* to 3 1 direction: a first nucleotide sequence to be 
transcribed, a tet operator sequence(s) of a first class type, and a second nucleotide sequence 
to be transcribed 

• a second nucleic acid comprising in a 5 f to 3' direction: a third nucleotide sequence to 
20 be transcribed, a tet operator sequence(s) of a second class type, and a fourth nucleotide 
sequence to be transcribed. 

Transcription of the first and second nucleotide sequences in the first nucleic acid proceeds in 
a divergent manner from the first class of tet operator sequence(s). Likewise, transcription of 

25 the third and fourth nucleotide sequences in the second nucleic acide proceeds in a divergent 
manner from the second class of tet operator sequence(s). Thus, expression of the first and 
second nucleotide sequences is coordinately regulated and expression of the third and fourth 
nucleotide sequences is coordinately regulated. However, expression of the first and second 
sequences is independently (and oppositely) regulated compared to the third and fourth 

30 sequences through the use of two different transactivator fusion proteins, as described above, 
one which activates transcription in the presence of Tc (or analogue thereof) and the other 
which activates transcription in the absence of Tc (or analogue thereof). One transactivator is 
designed to bind to a tet operators of the first class type and the other is designed to bind to a 
tet operators of the second class type. In other embodiments, rather than already containing 

35 first, second, third and/or fourth nucleotide sequences to be transcribed, these transcription 
units can contain cloning sites which allow for the introduction of first, second, third and/or 
fourth nucleotide sequences to be transcribed. 




TV. Tetracvcline-Regnlated Transcriptional Inhibitors 

Another aspect of the invention pertains to transcriptional inhibitor fusion proteins. 
The inhibitor fusion proteins of the invention are constructed similarly to the transactivator 
fusion proteins of the invention (see Section I above) but instead of containing a polypeptide 
domain that stimulates transcription in eukaryotic cells, the inhibitor fusion proteins contain a 
polypeptide domain that inhibits transcription in eukaryotic cells. The inhibitor fusion 
proteins are used to downregulate the expression of genes operably linked to tetO sequences. 
For example, when a te/O-linked gene is introduced into a host cell or animal, the level of 
basal, constitutive expression of the gene may vary depending upon the type of cell or tissue 
in which the gene is introduced and on the site of integration of the gene. Alternatively, 
constitutive expression of endogenous genes into which tetO sequences have been introduced 
may vary depending upon the strength of additional endogenous regulatory sequences in the 
vicinity. The inhibitor fusion proteins described herein provide compositions that can be 
used to inhibit the expression of such te/O-linked genes in a controlled manner. 

In one embodiment, the inhibitor fusion protein of the invention comprises a first 
polypeptide that binds to tet operator sequences in the absence, but not the presence, of 
tetracycline (Tc) or an analogue thereof operatively linked to a heterologous second 
polypeptide that inhibits transcription in eukaryotic cells. In another embodiment, the 
inhibitor fusion protein comprises a first polypeptide that binds to tet operator sequences in 
the presence, but not the absence, of tetracycline operatively linked to a heterologous second 
polypeptide that inhibits transcription in eukaryotic cells. The term "heterologous" is 
intended to mean that the second polypeptide is derived from a different protein than the first 
polypeptide. Like the transactivator fusion proteins, the transcriptional inhibitor fusion 
proteins can be prepared using standard recombinant DNA techniques as described herein. 

A. The first polypeptide of the transcriptional inhibitor fusion protein 

The transcriptional inhibitor fusion protein of the invention is composed, in part, of a 
first polypeptide which binds to a tet operator sequence either (i) in the absence, but not the 
presence of tetracycline (Tc), or an analogue thereof, or alternatively, (ii) in the presence, but 
not the absence of Tc or an analogue thereof. 

Preferably, in the former embodiment, the first polypeptide is a wild-type Tet 
repressor (which binds to tet operator sequences in the absence but not the presence of Tc). 
A wild-type Tet repressor of any class (e.g., A, B, C, D or E) may be used as the first 
polypeptide. Preferably, the wild-type Tet repressor is a TnlO-derived Tet repressor. The 
nucleotide and amino acid sequences of a wild-type TnlO-derived Tet repressor are shown in 
SEQ ID NO: 16 and SEQ ID NO: 17, respectively. 

Alternatively, in the latter embodiment, the first polypeptide is a mutated Tet 
repressor as described in Section I, part A above (which binds to tet operator sequences in the 
presence but not the absence of Tc). A mutated Tet repressor of any class (e.g., A, B, C, D or 




E) may be used as the first polypeptide. Preferably, the mutated Tet repressor is a TnlO- 
derived Tet repressor having one or more amino acid substitutions at positions 71 , 95, 101 
and/or 102. The nucleotide and amino acid sequences of such a mutated TnlO-derived Tet 
repressor are shown in SEQ ID NO: 18 and SEQ ID NO: 19, respectively. 

5 

B. The second polypeptide of the transcriptional inhibitor fusion protein 

The first polypeptide of the transcriptional inhibitor fusion protein is operatively 
linked to a second polypeptide which directly or indirectly inhibits transcription in eukaryotic 
cells. As described in Section I, above, to operatively link the first and second polypeptides 

10 of a fusion protein, typically nucleotide sequences encoding the first and second polypeptides 
are ligated to each other in-frame to create a chimeric gene encoding the fusion protein. 
However, the first and second polypeptides can be operatively linked by other means that 
preserve the function of each polypeptide (e.g., chemically crosslinked). Although the fusion 
proteins are typically described herein as having the first polypeptide at the amino-terminal 

1 5 end of the fusion protein and the second polypeptide at the carboxy-terminal end of the fusion 
protein, it will be appreciated by those skilled in the art that the opposite orientation (i.e., the 
second polypeptide at the amino-terminal end and the first polypeptide at the carboxy- 
terminal end) is also contemplated by the invention. 

Proteins and polypeptide domains within proteins which can function to inhibit 

20 transcription in eukaryotic cells have been described in the art (for reviews see, e.g., 

Renkawitz, R. (1990) Trends in Genetics £: 192-197; and Herschbach, B.M. and Johnson, 
A.D. (1993) Annu. Rev. Cell. Biol. 2:479-509). Such transcriptional inhibitor domains have 
been referred to in the art as "silencing domains" or "repressor domains." Although the 
precise mechanism by which many of these polypeptide domains inhibit transcription is not 

25 known (and the invention is not intended to be limited by mechanism), there are several 
possible means by which repressor domains may inhibit transcription, including: 1) 
competitive inhibition of binding of either activator proteins or the general transcriptional 
machinery, 2) prevention of the activity of a DNA bound activator and 3) negative 
interference with the assembly of a functional preinitiation complex of the general 

30 transcription machinery. Thus, a repressor domain may have a direct inhibitory effect on the 
transcriptional machinery or may inhibit transcription indirectly by inhibiting the activity of 
activator proteins. Accordingly, the term "a polypeptide that inhibits transcription in 
eukaryotic cells" as used herein is intended to include polypeptides which act either directly 
or indirectly to inhibit transcription. As used herein, "inhibition" of transcription is intended 

35 to mean a diminution in the level or amount of transcription of a target gene compared to the 
level or amount of transcription prior to regulation by the transcriptional inhibitor protein. 
Transcriptional inhibition may be partial or complete. The terms "silencer", "repressor" and 
"inhibitor" are used interchangeably herein to describe a regulatory protein, or domains 
thereof, that can inhibit transcription. 




A transcriptional "repressor" or "silencer" domain as described herein is a polypeptide 
domain that retains its transcriptional repressor function when the domain is transferred to a 
heterologous protein. Proteins which have been demonstrated to have repressor domains that 
can function when transferred to a heterologous protein include the v-erbA oncogene product 
(Baniahmad, A. et al. (1992) EMBOJ. 11:1015-1023), the thyroid hormone receptor 
(Baniahmad, supra), the retinoic acid receptor (Baniahmad, supra), and the Drosophila 
Krueppel (Kr) protein (Licht, J.D. et al. (1990) Nature 246:76-79; Sauer, F. and Jackie, H. 
(1991) Nature 151:563-566; Licht, J.D. et al. (1994) Mol Cell Biol 14:4057-4066). Non- 
limiting examples of other proteins which have transcriptional repressor activity in eukaryotic 
cells include the Drosophila homeodomain protein even-skipped (eve), the S. cerevisiae 
Ssn6/Tupl protein complex (see Herschbach and Johnson, supra), the yeast SIR1 protein (see 
Chien, et al. (1993) Cell 21:53 1-541), NePl (see Kohne, et al. (1993) J Mol Biol 212:747- 
755), the Drosophila dorsal protein (see Kirov, et al. (1994) Mol Cell Biol 14:713-722; 
Jiang, et al. (1993) EMBOJ. 12:3201-3209), TSF3 (see Chen, et al. (1993) Mol Cell Biol 
75:831-840), SF1 (see Targa, et al. (1992) Biochem. Biophys. Res. Comm. i£8:4 16-423), the 
Drosophila hunchback protein (see Zhang, et al. (1992) Proc. Natl Acad. Sci. USA &2:751 1- 
7515), the Drosophila knirps protein (see Gerwin, et al. (1994) Mol Cell Biol 14:7899- 
7908), the WT1 protein (Wilm's tumor gene product) (see Anant, et al. (1994) Oncogene 
2:31 13-3126; Madden et al., (1993) Oncogene &1713-1720), Oct-2.1 (see Lillycrop, et al. 
(1994) Mol Cell Biol 14:7633-7642), the Drosophila engrailed protein (see Badiani, et al. 
(1994) Genes Dev. 5:770-782; Han and Manley, (1993) EMBOJ. 12:2723-2733), E4BP4 (see 
Cowell and Hurst, (1994) Nucleic Acids Res. 22:59-65) and ZF5 (see Numoto, et al. (1993) 
Nucleic Acids Res. 21:3767-3775), 

In a preferred embodiment, the second polypeptide of the transcriptional inhibitor 
fusion protein of the invention is a transcriptional silencer domain of the Drosophila 
Krueppel protein. A C-terminal region having repressor activity can be used, such as amino 
acids 403-466 of the native protein (see Sauer, F. and Jackie, H., supra). This region is 
referred to as C64KR. The nucleotide and amino acid sequences of C64KR are shown in 
SEQ ID NO: 20 and SEQ ID NO: 21, respectively. Construction of an expression vector 
encoding a TetR-C64KR fusion protein is described in Example 4. Alternatively, an alanine- 
rich amino terminal region of Kr that also has repressor activity can be used as the second 
polypeptide of the fusion protein. For example, amino acids 26-1 10 of Kr (see Licht, J.D. et 
al., (1990) supra) can be used as the second polypeptide. Alternatively, shorter or longer 
polypeptide fragments encompassing either of the Kr silencer domains that still retain full or 
partial inhibitor activity are also contemplated (e.g., amino acids 62 to 92 of the N-terminal 
silencer domain; see Licht, et al. (1994) supra). 

In another preferred embodiment, the second polypeptide of the transcriptional 
inhibitor fusion protein of the invention is a transcriptional silencer domain of the v-erbA 
oncogene product. The silencer domain of v-erbA has been mapped to approximately amino 




acid residues 362-632 of the native v-erbA oncogene product (see Baniahmad, et al. supra). 
Accordingly, a fragment encompassing this region is used as the second polypeptide of the 
silencer domain. In one embodiment, amino acid residues 364-635 of the native v-erbA 
protein are used. The nucleotide and amino acid sequences of this region of v-erbA are 
shown in SEQ ID NO: 22 and SEQ ID NO: 23, respectively. Construction of an expression 
vector encoding a TetR-v-erbA fusion protein is described in Example 5. Alternatively, 
shorter or longer polypeptide fragments encompassing the v-erbA silencer region that still 
retain full or partial inhibitor activity are also contemplated. For example, a.a. residues 346- 
639, 362-639, 346-632, 346-616 and 362-616 of v-erbA may be used. Additionally, 
polypeptide fragments encompassing these regions that have internal deletions yet still retain 
full or partial inhibitor activity are encompassed by the invention, such as a.a. residues 362- 
468/508-639 of v-erbA. Furthermore, two or more copies of the silencer domain may be 
included in the fusion protein, such as two copies of a.a. residues 362-616 of v-erbA. 
Suitable silencer polypeptide domains of v-erbA are described further in Baniahmad, A. et al. 
(supra). 

In other embodiments, other silencer domains are used. Non-limiting examples of 
polypeptide domains that can be used include: amino acid residues 120-410 of the thyroid 
hormone receptor alpha (THRot), amino acid residues 143-403 of the retinoic acid receptor 
alpha (RARct), amino acid residues 186-232 of knirps, the N-terminal region of WT 1 (see 
Anant, supra), the N-terminal region of Oct-2.1 (see Lillycrop, supra), a 65 amino acid 
domain of E4BP4 (see Cowell and Hurst, supra) and the N-terminal zinc finger domain of 
ZF5 (see Numoto, supra). Moreover, shorter or longer polypeptide fragments encompassing 
these regions that still retain full or partial inhibitor activity are also contemplated. 

In addition to previously described transcriptional inhibitor domains, novel 
transcriptional inhibitor domains, which can be identified by standard techniques, are within 
the scope of the invention. The transcriptional inhibitor ability of a polypeptide can be 
assayed by: 1) constructing an expression vector that encodes the test silencer polypeptide 
linked to another polypeptide having DNA binding activity (i.e., constructing a DNA binding 
domain-silencer domain fusion protein), 2) cotransfecting this expression vector into host 
cells together with a reporter gene construct that is normally constitutively expressed in the 
host cell and also contains binding sites for the DNA binding domain and 3) determining the 
amount of transcription of the reporter gene construct that is inhibited by expression of the 
fusion protein in the host cell. For example, a standard assay used in the art utilizes a fusion 
protein of a GAL4 DNA binding domain (e.g., amino acid residues 1-147) and a test silencer 
domain. This fusion protein is then used to inhibit expression of a reporter gene construct 
that contains positive regulatory sequences (that normally stimulate constitutive transcription) 
and GAL4 binding sites (see e.g., Baniahmad, supra). 




C. A third polypeptide of the transcriptional inhibitor fusion protein 

In addition to a Tet repressor and a transcriptional silencer domain, a transcriptional 
inhibitor fusion protein of the invention can contain an operatively linked third polypeptide 
which promotes transport of the fusion protein to a cell nucleus. As described for the 
transactivator fusion proteins (see Section I, Part C, above), a nuclear localization signal can 
be incorporated into the transcriptional inhibitor fusion protein. 

D. Expression of the transcriptional inhibitor fusion protein 

A nucleic acid molecule encoding a transcriptional inhibitor fusion protein of the 
invention can be incorporated into a recombinant expression vector and introduced into a host 
cell to express the fusion protein in the host cell as described in Section II, Parts A, B and C, 
above. Preferably, a host cell expressing a transcriptional inhibitor fusion protein of the 
invention also carries a tet operator-linked gene of interest (i.e., target nucleotide sequence to 
be transcribed). 

Transgenic organisms expressing a transcriptional inhibitor fusion protein in cells 
thereof can be prepared as described in Section II, Part D, above. Moreover, homologous 
recombinant organisms expressing a transcriptional inhibitor fusion protein in cells thereof 
are also encompassed by the invention and can be prepared as described in Section II, Part E, 
above. The invention provides recombinant expression vectors suitable for homologous 
recombination. In one embodiment, such an expression vector comprises a nucleic acid 
molecule encoding a transcriptional inhibitor fusion protein of the invention which is flanked 
at its 5* and 3* ends by additional nucleic acid of a eukaryotic gene, the additional nucleic acid 
being of sufficient length for successful homologous recombination with the eukaryotic gene. 
Vectors and methods for creating homologous recombinant organisms that express the 
components of the regulatory system of the invention, and uses therefor, are described in 
further detail in U.S. Patent Application Serial Number 08/260,452. Preferably, a transgenic 
or homologous recombinant organism of the invention expressing a transcriptional inhibitor 
fusion protein in cells thereof also carries a tet operator-linked gene of interest (i.e., target 
nucleotide sequence to be transcribed) in cells thereof. 

y T Kits of the Invention 

Another aspect of the invention pertains to kits which include the components of the 
inducible regulatory system of the invention. Such a kit can be used to regulate the 
expression of a gene of interest (i.e., a nucleotide sequence of interest to be transcribed) 
which can be cloned into a target transcription unit. The kit may include nucleic acid 
encoding a transcriptional activator fusion protein or a transcriptional inhibitor fusion protein 
or both. Alternatively, eukaryotic cells which have nucleic acid encoding a transactivator 
and/or inhibitor fusion protein stably incorporated therein, such that the transactivator and/or 
inhibitor fusion protein are expressed in the eukaryotic cell, may be provided in the kit. 



In one embodiment, the kit includes a carrier means having in close confinement 
therein at least two container means: a first container means which contains a first nucleic 
acid (e.g., DNA) encoding a transactivator fusion protein of the invention (e.g., a recombinant 
expression vector encoding a first polypeptide which binds to a tet operator sequence in the 
5 presence of tetracycline operatively linked to a second polypeptide which activates 

transcription in eukaryotic cells), and a second container means which contains a second 
target nucleic acid (e.g., DNA) for the transactivator into which a nucleotide sequence of 
interest can be cloned. The second nucleic acid typically comprises a cloning site for 
introduction of a nucleotide sequence to be transcribed (optionally including an operatively 

10 linked minimal promoter sequence) and at least one operatively linked tet operator sequence. 
The term "cloning site" is intended to encompass at least one restriction endonuclease site. 
Typically, multiple different restriction endonuclease sites (e.g., a polylinker) are contained 
within the nucleic acid. 

To regulate expression of a nucleotide sequence of interest using the components of 

15 the kit, the nucleotide sequence is cloned into the cloning site of the target vector of the kit by 
conventional recombinant DNA techniques and then the first and second nucleic acids are 
introduced into a host cell or animal. The transactivator fusion protein expressed in the host 
cell or animal then regulates transcription of the nucleotide sequence of interest in the 
presence of the inducing agent (Tc or analogue thereof). 

20 Alternatively, in another embodiment, the kit includes a eukaryotic cell which is 

stably transfected with a nucleic acid encoding a transactivator fusion protein of the invention 
such that the transactivator is expressed in the cell. Thus, rather than containing nucleic acid 
alone, the first container means described above can contain a eukaryotic cell line into which 
the first nucleic acid encoding the transactivator has been stably introduced (e.g., by stable 

25 transfection by a conventional method such as calcium phosphate precipitation or 

electroporation, etc.). In this embodiment, a nucleotide sequence of interest is cloned into the 
cloning site of the target vector of the kit and then the target vector is introduced into the 
eukaryotic cell expressing the transactivator fusion protein. 

Alternatively or additionally, a recombinant vector of the invention for coordinate 

30 regulation of expression of two nucleotide sequences can also be incorporated into a kit of the 
invention. The vector can be included in the kit in a form that allows for introduction into the 
vector of two nucleotide sequences of interest. Thus, in another embodiment, a kit of the 
invention includes 1) a first nucleic acid encoding a transactivator fusion protein of the 
invention (or a eukaryotic cell into which the nucleic acid has been stably introduced) and 2) 

35 a second nucleic acid comprising a nucleotide sequence comprising in a 5' to 3' direction: a 
first cloning site for introduction of a first nucleotide sequence of interest operatively linked 
to at least one tet operator sequence operatively linked to a second cloning site for 
introduction of a second nucleotide sequence of interest, wherein transcription of the first and 
second nucleotide sequences proceeds in opposite directions from the at least one tet operator 




sequence. Optionally, the vector can include operatively linked minimal promoter sequences. 
In another embodiment, the vector can be in a form that already contains one nucleotide 
sequence to be transcribed (e.g., encoding a detectable marker such as luciferase, p- 
galactosidase or CAT) and a cloning site for introduction of a second nucleotide sequence of 
5 interest to be transcribed. 

The transcription units and transactivators of the invention for independent regulation 
of expression of two nucleotide sequences to be transcribed can also be incorporated into a kit 
of the invention. The target transcription units can be in a form which allows for introduction 
into the transcription units of nucleotide sequences of interest to be transcribed. Thus, in 

1 0 another embodiment, a kit of the invention includes 1) a first nucleic acid encoding a 

transactivator which binds to a tet operator of a first class type in the presence of Tc or an 
analogue thereof, 2) a second nucleic acid comprising a first cloning site for introduction of a 
first nucleotide sequence to be transcribed operatively linked to at least one tet operator of a 
first class type, 3) a third nucleic acid encoding a transactivator which binds to a tet operator 

15 of a second class type in the absence of Tc or an analogue thereof, , and 4) a fourth nucleic 
acid comprising a second cloning site for introduction of a second nucleotide sequence to be 
transcribed operatively linked to at least one tet operator of a second class type. (Optionally, 
minimal promoter sequences are included in the second and fourth nucleic acids). In another 
embodiment, one nucleotide sequence to be transcribed (e.g., encoding a suicide gene) is 

20 already contained in either the second or the fourth nucleic acid. In yet another embodiment, 
the nucleic acids encoding the transactivators (e.g., the first and third nucleic acids described 
above) can be stably introduced into a eukaryotic cell line which is provided in the kit. 

In yet another embodiment, a kit of the invention includes a first container means 
containing a first nucleic acid encoding a transcriptional inhibitor fusion protein of the 

25 invention (e.g., the fusion protein inhibits transcription in eukaryotic cells either only in the 
presence of Tc or only the absence of Tc) and a second container means containing a second 
nucleic acid comprising a cloning site for introduction of a nucleotide sequence to be 
transcribed operatively linked to at least one tet operator sequence. The kit may further 
include a third nucleic acid encoding a transactivator fusion protein that binds to tetO 

30 sequences either only in the presence of Tc or only in the absence of Tc. Alternatively, the 
first and/or third nucleic acids (i.e., encoding the inhibitor or transactivator fusion proteins) 
may be stably incorporated into a eukaryotic host cell which is provided in the kit. 

In still another embodiment, a kit of the invention may include at least one 
tetracycline or tetracycline analogue. For example, the kit may include a container means 

35 which contains tetracycline, anhydrotetracycline, doxycycline, epioxytetracycline or other 
tetracycline analogue described herein. 




VI. Regulation of Gene E xpression bv Tetracycline or Analogues Thereof 

A. Stimulation of Gene Expression by Transactivator Fusion Proteins 

In a host cell which carries nucleic acid encoding a transactivator fusion protein of the 
5 invention and a nucleotide sequence operatively linked to the tet operator sequence(i.e., gene 
of interest to be transcribed), high level transcription of th& nucleotide sequence operatively 
linked to the tet operator sequence(s) does not occur in the absence of the inducing agent, 
tetracycline or analogues thereof The level of basal transcription of the nucleotide sequence 
may vary depending upon the host cell and site of integration of the sequence, but is 

10 generally quite low or even undetectable in the absence of Tc. In order to induce 

transcription in a host cell, the host cell is contacted with tetracycline or a tetracycline 
analogue. Accordingly, another aspect of the invention pertains to methods for stimulating 
transcription of a nucleotide sequence operatively linked to a tet operator sequence in a host 
cell or animal which expresses a transactivator fusion protein of the invention. The methods 

1 5 involve contacting the cell with tetracycline or a tetracycline analogue or administering 
tetracycline or a tetracycline analogue to a subject containing the cell. 

The term "tetracycline analogue" is intended to include compounds which are 
structurally related to tetracycline and which bind to the Tet repressor with a K a of at least 
about 10 6 M" 1 . Preferably, the tetracycline analogue binds with an affinity of about 10 9 

20 M" 1 or greater. Examples of such tetracycline analogues include, but are not limited to, 

anhydrotetracycline, doxycycline, chlorotetracycline, oxytetracycline and others disclosed by 
Hlavka and Boothe, "The Tetracyclines," in Handbook of Experimental Pharmacology 78, 
R.K. Blackwood et al. (eds.), Springer- Verlag, Berlin-New York, 1985; L.A. Mitscher, "The 
Chemistry of the Tetracycline Antibiotics", Medicinal Research 9, Dekker, New York, 1978; 

25 Noyee Development Corporation, "Tetracycline Manufacturing Processes" Chemical Process 
Reviews, Park Ridge, NJ, 2 volumes, 1969; R.C. Evans, "The Technology of the 
Tetracyclines", Biochemical Reference Series 7, Quadrangle Press, New York, 1968; and 
H.F. Dowling, "Tetracycline", Antibiotic Monographs, no. 3, Medical Encyclopedia, New 
York, 1955. Preferred Tc analogues for high level stimulation of transcription are 

30 anhydrotetracycline and doxycycline. A Tc analogue can be chosen which has reduced 

antibiotic activity compared to Tc. Examples of such Tc analogues are anhydrotetracycline, 
epioxytetracycline and cyanotetracycline. 

To induce gene expression in a cell in vitro, the cell is contacted with Tc or a Tc 
analogue by culturing the cell in a medium containing the compound. When culturing cells 

35 in vitro in the presence of Tc or Tc analogue, a preferred concentration range for the inducing 
agent is between about 10 and about 1000 ng/ml. Tc or a Tc analogue can be directly added 
to media in which cells are already being cultured, or more preferably for high levels of gene 
induction, cells are harvested from Tc-free media and cultured in fresh media containing Tc, 
or an analogue thereof. 




To induce gene expression in vivo, cells within in a subject are contacted with Tc or a 
Tc analogue by administering the compound to the subject. The term "subject" is intended to 
include humans and other non-human mammals including monkeys, cows, goats, sheep, 
dogs, cats, rabbits, rats, mice, and transgenic and homologous recombinant species thereof. 
5 Furthermore, the term "subject" is intended to include plants, such as transgenic plants. 

When the inducing agent is administered to a human or animal subject, the dosage is adjusted 
to preferably achieve a serum concentration between about 0.05 and 1.0 jig/ml. Tc or a Tc 
analogue can be administered to a subject by any means effective for achieving an in vivo 
concentration sufficient for gene induction. Examples of suitable modes of administration 

10 include oral administration (e.g., dissolving the inducing agent in the drinking water), slow 
release pellets and implantation of a diffusion pump. To administer Tc or a Tc analogue to a 
transgenic plant, the inducing agent can be dissolved in water administered to the plant. 

The ability to use different Tc analogues as inducing agents in this system allows for 
modulate the level of expression of a tet operator-linked nucleotide sequence. As 

1 5 demonstrated in Example 2, anhydrotetracycline and doxycycline have been found to be 
strong inducing agents. The increase in transcription of the target sequence is typically as 
high as 1 000- to 2000-fold, and induction factors as high as 20,000 fold can be achieved. 
Tetracycline, chlorotetracycline and oxytetracycline h^ive been found to be weaker inducing 
agents, i.e., the increase in transcription of a target sequence is in the range of about 10-fold. 

20 Thus, an appropriate tetracycline analogue is chosen as an inducing agent based upon the 
desired level of induction of gene expression. It is also possible to change the level of gene 
expression in a host cell or animal over time by changing the Tc analogue used as the 
inducing agent. For example, there may be situations where it is desirable to have a strong 
burst of gene expression initially and then have a sustained lower level of gene expression. 

25 Accordingly, an analogue which stimulates a high levels of transcription can be used initially 
as the inducing agent and then the inducing agent can be switched to an analogue which 
stimulates a lower level of transcription. Moreover, when regulating the expression of 
multiple nucleotide sequences (e.g., when one sequence is regulated by a one of class tet 
operator sequence(s) and the other is regulated by another class of tet operator sequence(s), as 

30 described above in Section III, Part C, above), it may be possible to independently vary the 
level of expression of each sequence depending upon which transactivator fusion protein is 
used to regulate transcription and which Tc analogue(s) is used as the inducing agent. 
Different transactivator fusion proteins are likely to exhibit different levels of responsiveness 
to Tc analogues. The level of induction of gene expression by a particular combination of 

35 transactivator fusion protein and inducing agent (Tc or Tc analogue) can be determined by 

techniques described herein, (e.g., see Example 2). Additionally, the level of gene expression 
can be modulated by varying the concentration of the inducing agent. Thus, the expression 
system of the invention provides a mechanism not only for turning gene expression on or off, 




but also for "fine tuning" the level of gene expression at intermediate levels depending upon 
the type and concentration of inducing agent used. 

B. Inhibition of Gene Expression by Transcriptional Inhibitor Fusion Proteins 
5 The invention also provides methods for inhibiting gene expression using the 

transcriptional inhibitor fusion proteins of the invention. These methods can be used to 
down-regulate basal, constitutive or tissue-specific transcription of a te/O-linked gene of 
interest. For example, a gene of interest that is operatively linked to tetO sequences and 
additional positive regulatory elements (e.g., consitutive or tissue-specific enhancer 

10 sequences) will be transcribed in host cells at a level that is primarily determined by the 

strength of the positive regulatory elements in the host cell. Moreover, a gene of interest that 
is operatively linked to te/Osequences and only a minimal promoter sequence may exhibit 
varying degrees of basal level transcription depending on the host cell or tissue and/or the site 
of integration of the sequence. In a host cell containing such a target sequence and 

15 expressing an inhibitor fusion protein of the invention, transcription of the target sequence 
can be down regulated in a controlled manner by altering the concentration of Tc (or 
analogue) in contact with the host cell. For example, when the inhibitor fusion protein binds 
to tetO in the absence of Tc, the concentration of Tc in contact with the host cell is reduced to 
inhibit expression of the target gene. Preferably, a host cell is cultured in the absence of Tc to 

20 keep target gene expression repressed. Likewise, Tc is not administered to a host organism to 
keep target gene expression repressed. Alternatively, when the inhibitor fusion protein binds 
to tetO in the presence of Tc, the concentration of Tc in contact with the host cell is increased 
to inhibit expression of the target gene. For example, Tc is added to the culture medium of a 
host cell or Tc is administered to a host organism to repress target gene expression. 

25 The inhibitor fusion proteins described herein can inhibit a tefO-linked gene of 

interest in which the tetO sequences are positioned 5 f of a minimal promoter sequence (e.g., 
tetracycline-regulated transcription units as described in Section III, above). Furthermore, the 
inhibitor fusion protein may be used to inhibit expression of a gene of interest in which tetO- 
linked sequences are located 3 f of the promoter sequence but 5* of the transcription start site. 

30 Still further, the inhibitor fusion protein may be used to inhibit expression of a gene of 
interest in which tefO-linked sequences are located 3' of the transcription start site. 

Various Tc analogues as described in Section VI, part A, above, with respect to the 
transactivator fusion proteins can similarly be used to regulate the activity of the inhibitor 
fusion proteins. Moreover, the methods of in vitro culture with Tc (or analogue) and in vivo 

35 administration of Tc (or analogue) described in Section VI, part A, are equally applicable to 
the transcriptional inhibitor fusion proteins. 




C. Combined Positive and Negative Regulation of Gene Expression 

In addition to regulating gene expression using either a transcriptional activator or 
inhibitor fusion protein alone, the two types of fusion proteins can be used in combination to 
allow for both positive and negative regulation of expression of one or more target genes in a 
5 host cell. Thus, a transcriptional inhibitor protein that binds to tetO either (i) in the absence, 
but not the presence, of Tc, or (ii) in the presence, but not the absence, of Tc, can be used in 
combination with a transactivator protein that binds to tetO either (i) in the absence, but not 
the presence, of Tc, or (ii) in the presence, but not the absence, of Tc. Transactivator proteins 
that bind to tetO in the absence, but not the presenc, of Tc (e.g., wild-type TetR-activator 

10 fusion proteins) are described in further detail in USSN 08/076,726, USSN 08/076,327 and 
USSN 08/260,452. Transactivator fusion proteins that bind to tetO in the presence, but not 
the absence, of Tc (e.g., mutated TetR-activator fusion proteins) are described herein (see 
Section I above) and in USSN 08/270,637 and USSN 08/275,876. Transcriptional inhibitor 
fusion proteins are described herein in Section IV. 

15 As described above in Section III, Part C, when more than one TetR fusion protein is 

expressed in a host cell or organism, additional steps may be taken to inhibit 
heterodimerization between the different TetR fusion proteins. For example, a transactivator 
composed of a TetR of one class may be used in combination with a transcriptional inhibitor 
composed of a TetR of a second, different class that does not heterodimerize with the first 

20 class of TetR. Alternatively, amino acid residues of the TetR involved in dimerization may 
be mutated to inhibit heterodimerization. However, even if some heterodimerization between 
transactivator and inhibitor fusion proteins occurs in a host cell, sufficient amounts of 
homodimers should be produced to allow for efficient positive and negative regulation as 
described herein. 

25 It will be appreciated by those skilled in the art that various combinations of activator 

and inhibitor proteins can be used to regulate a single te/O-linked gene of interest in both a 
positive and negative manner or to regulate multiple te/O-linked genes of interest in a 
coordinated manner or in an independent manner using the teachings described herein. The 
precise regulatory components utilized will depend upon the genes to be regulated and the 

30 type of regulation desired. Several non-limiting examples of how the transactivator and 

inhibitor fusion proteins may be used in combination are described further below. However, 
many other possible combinations will be evident to the skilled artisan in view of the 
teachings herein and are intended to be encompassed by the invention. 

In a preferred embodiment, illustrated schematically in Figure 10, expression of a 

35 tefO-linked target gene of interest in a host cell is regulated in both a negative and positive 
manner by the combination of an inhibitor fusion protein that binds to tetO in the absence, 
but not the presence, of tetracycline or analogue thereof (referred to as a tetracycline 
controlled silencing domain, or tSD) and an activator fusion protein that binds to tetO in the 
presence, but not the absence, of tetracycline or analogue thereof (referred to as a reverse 




tetracycline controlled transactivator, or rtTA). In addition to tetO sequences, the target gene 
is linked to a promoter, and may contain other positive regulatory elements (e.g., enhancer 
sequences) that contribute to basal level, constitutive transcription of the gene in the host cell. 
Binding of tSD to the tetO sequences in the absence of tetracycline or analogue (e.g., 
5 doxycycline) inhibits the basal constitutive transcription of the gene of interest, thus keeping 
the gene of interest in a repressed state until gene expression is desired. When gene 
expression is desired, the concentration of tetracycline or analogue (e.g., doxycycline) in 
contact with the host cell increased. Upon addition of the drug, tSD loses the ability to bind 
to tetO sequences whereas the previously unbound rtTA acquires the ability to bind to tetO 

10 sequences. The resultant binding of rtTA to the tetO sequences linked to the gene of interest 
thus stimulates transcription of the gene of interest. The level of expression may be 
controlled by the concentration of tetracycline or analogue, the type of Tc analogue used, the 
duration of induction, etc., as described previously herein. It will be appreciated that the 
reverse combination of fusion proteins (i.e., the inhibitor binds in the presence but not the 

1 5 absence of the drug and the activator binds in the absence but not the presence of the drug) 
can also be used. In this case, expression of the gene of interest is kept repressed by 
contacting the host cell with the drug (e.g., culture with Tc or analogue) and gene expression 
is activated by removal of the drug. 

In another embodiment, the activator and inhibitor fusion proteins,as described in the 

20 previous paragraph, are used in combination to coordinately regulate, in both a positive and 
negative manner, two genes of interest using the bidirectional tefO-linked transcription unit 
described in Section III, Part B above. In this case, Gene 1 and Gene 2 are linked to the same 
tetO sequence(s), but in opposite orientations. The inhibitor fusion protein is used to repress 
basal levels of transcription of both Gene 1 and Gene2 in a coordinate manner, whereas the 

25 transactivator fusion protein is used to stimulate expression of Gene 1 and Gene 2 in a 
coordinate manner. 

In yet another embodiment, the activator and inhbitor fusion proteins are used to 
independently regulate two or more genes of interest using the te/O-linked transcription units 
as described in Section III, Part C above. For example, in one embodiment, a transactivator 

30 fusion protein that binds to one class of tetO sequences (e.g., class A) in the presence, but not 
the absence of Tc or analogue is used in combination with an inhibitor fusion protein that 
binds to a second, different class of tetO sequences (e.g., class B) also in the presence, but not 
the absence, of Tc or analogue. In a host cell containing Gene 1 linked to class A tetO 
sequences and Gene 2 linked to class B tetO sequences, both genes will be expressed at basal 

35 levels in the absence of the drug, whereas expression of Gene 1 will be stimulated upon 
addition of the drug and expression of Gene 2 will be repressed upon addition of the drug. 

Alternatively, in another embodiment, the transactivator binds to one class of tetO 
sequences (e.g., class A) in the presence, but not the absence, of Tc or analogue and the 
inhibitor fusion protein binds to a second, different class of tetO sequences (e.g., class B) in 
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the absence but not the presence of Tc or analogue. In the host cell as described in the 
previous paragraph, Gene 1 will be expressed at basal levels in the absence of the drug and 
will be stimulated upon addition of the drug, whereas Gene 2 will be repressed in the absence 
of the drug but will have basal levels expression upon addition of the drug. Various other 
possible combinations will be apparent to the skilled artisan. Transactivator and inhibitor 
fusion proteins that bind to different classes of tetO sequences can be prepared as described in 
Section I, Part A. Target transcription units comprising tetO sequences of different classes 
can be prepared as described in Section III, Part C. 

VII. Applications of the Invention 

The invention is widely applicable to a variety of situations where it is desirable to be 
able to turn gene expression on and off, or regulate the level of gene expression, in a rapid, 
efficient and controlled manner without causing pleiotropic effects or cytotoxicity. Thus, the 
system of the invention has widespread applicability to the study of cellular development and 
differentiation in eukaryotic cells, plants and animals. For example, expression of oncogenes 
can be regulated in a controlled manner in cells to study their function. Additionally, the 
system can be used to regulate the expression of site-specific recombinases, such as CRE or 
FLP, to thereby allow for irreversible modification of the genotype of a transgenic organism 
under controlled conditions at a particular stage of development. For example, drug 
resistance markers inserted into the genome of transgenic plants that allow for selection of a 
particular transgenic plant could be irreversibly removed via a Tc-regulated site specific 
recombinase. Other applications of the regulatory system of the invention include: 

A. Gene Therapy 

The invention may be particularly useful for gene therapy purposes, in treatments for 
either genetic or acquired diseases. The general approach of gene therapy involves the 
introduction of nucleic acid into cells such that one or more gene products encoded by the 
introduced genetic material are produced in the cells to restore or enhance a functional 
activity. For reviews on gene therapy approaches see Anderson, W.F. (1992) Science 
256:808-813; Miller, A.D. (1992) Nature 357:455-460; Friedmann, T. (1989) Science 
244:1275-1281; and Cournoyer, D., et al. (1990) Curr. Opin. Biotech. 1:196-208. However, 
current gene therapy vectors typically utilize constitutive regulatory elements which are 
responsive to endogenous transcriptions factors. These vector systems do not allow for the 
ability to modulate the level of gene expression in a subject. In contrast, the inducible 
regulatory system of the invention provides this ability. 

To use the system of the invention for gene therapy purposes, in one embodiment, 
cells of a subject in need of gene therapy are modified to contain 1) nucleic acid encoding a 
transactivator fusion protein of the invention in a form suitable for expression of the 



transactivator in the host cells and 2) a gene of interest (e.g., for therapeutic purposes) 
operatively linked to a tet operator sequence(s). The cells of the subject can be modified ex 
vivo and then introduced into the subject or the cells can be directly modified in vivo 
(methods for modification of the cells are described above in Section II). Expression of the 
5 gene of interest in the cells of the subject is then stimulated by administering Tc or a Tc 
analogue to the patient. The level of gene expression can be varied depending upon which 
particular Tc analogue is used as the inducing agent. The level of gene expression can also 
be modulated by adjusting the dose of the tetracycline, or analogue thereof, administered to 
the patient to thereby adjust the concentration achieved in the circulation and the tissues of 
10 interest. 

Moreover, in another embodiment, a transcriptional inhibitor fusion protein is used to 
further control the level of expression of the gene of interest. For example, the cells of the 
subject can be modified to also contain a nucleic acid encoding a transcriptional inhibitor 
fusion protein that binds to tetO in the absence of Tc. The nucleic acid is in a form suitable 

1 5 for expression of the inhibitor fusion protein in the host cells. Thus, prior to administration 
of Tc (or analogue) to the subject, the basal level of transcription of the gene of interest will 
be kept silent by the inhibitor fusion protein. Upon administration of Tc, binding of the 
inhibitor fusion protein to tetO will be inhibited whereas binding of the transactivator fusion 
will be induced, thereby stimulating transcription of the gene of interest. Such combined 

20 positive and negative regulation of gene expression using both a transactivator fusion protein 
and transcriptional inhibitor fusion protein of the invention is illustrated schematically in 
Figure 10. 

Conventional detection methods known in the art, such as an enzyme linked 
immunosorbent assay, can be used to monitor the expression of the regulated protein of 

25 interest in the host cells and the concentration of Tc or Tc analogue can be varied until the 

desired level of expression of the protein of interest is achieved. Accordingly, expression of a 
protein of interest can be adjusted according to the medical needs of an individual, which 
may vary throughout the lifetime of the individual. To stop expression of the gene of interest 
in cells of the subject, administration of the inducing agent is stopped. Thus, the regulatory 

30 system of the invention offers the advantage over constitutive regulatory systems of allowing 
for modulation of the level of gene expression depending upon the requirements of the 
therapeutic situation. 

Genes of particular interest to be expressed in cells of a subject for treatment of 
genetic or acquired diseases include those encoding adenosine deaminase, Factor VIII, Factor 

35 IX, dystrophin, p-globin, LDL receptor, CFTR, insulin, erythropoietin, anti-angiogenesis 
factors, growth hormone, glucocerebrosidase, P-glucouronidase, a 1 -antitrypsin, 
phenylalanine hydroxylase, tyrosine hydroxylase, ornithine transcarbamylase, 
arginosuccinate synthetase, UDP-glucuronysyl transferase, apoAl, TNF, soluble TNF 
receptor, interleukins (e.g., IL-2), interferons (e.g., a- or y-IFN) and other cytokines and 




growth factors. Cells types which can be modified for gene therapy purposes include 
hematopoietic stem cells, myoblasts, hepatocytes, lymphocytes, skin epithelium and airway 
epithelium. For further descriptions of cell types, genes and methods for gene therapy see 
e.g., Wilson, J.M et al. (1988) Proc. Natl Acad Sci. USA ££3014-3018; Amientano, D. et al. 
5 (1990) Proc. Natl. Acad. Set USA £2:6141-6145; Wolff, J.A. et al. (1990) Science 242:1465- 
1468; Chowdhury, J.R. et al. (1991) Science 254:1802-1805; Ferry, N. et al. (1991) Proc. 
Natl Acad. Sci. USA ££8377-8381; Wilson, J.M. et al. (1992),/ Biol Chem. 262:963-967; 
Quantin, B. et al. (1992) Proc. Natl Acad. Sci. USA £2:2581-2584; Dai, Y. et al. (1992) 
Proc. Natl Acad. Sci. USA £2:10892-10895; van Beusechem, V.W. et al. (1992) Proc. Natl 

10 Acad. Sci. USA £2:7640-7644; Rosenfeld, M.A. et al. (1992) Cell 143-1 55; Kay, M.A. et 
al. (1992) Human Gene Therapy 2:641-647; Cristiano, RJ. et al. (1993) Proc. Natl Acad. 
Sci. USA 2&2122-2126; Hwu, P. et al. (1993) J. Immunol 15Q:41 04-41 15; and Herz, J. and 
Gerard, R.D. (1993) Proc. Natl. Acad. Sci. USA 20:2812-2816. 

Gene therapy applications of particular interest in cancer treatment include 

15 overexpression of a cytokine gene (e.g., TNF-a) in tumor infiltrating lymphocytes or ectopic 
expression of cytokines in tumor cells to induce an anti-tumor immune response at the tumor 
site), expression of an enzyme in tumor cells which can convert a non-toxic agent into a toxic 
agent, expression of tumor specific antigens to induce an anti-tumor immune response, 
expression of tumor suppressor genes (e.g., p53 or Rb) in tumor cells, expression of a 

20 multidrug resistance gene (e.g., MDR1 and/or MRP) in bone marrow cells to protect them 
from the toxicity of chemotherapy. 

Gene therapy applications of particular interest in treatment of viral diseases include 
expression of trans-dominant negative viral transactivation proteins, such as trans-dominant 
negative tat and rev mutants for HIV or trans-dominant ICp4 mutants for HSV (see e.g., 

25 Balboni, P.G. et al. (1993) J. Med. Virol. 41:289-295; Liem, S.E. et al. (1993) Hum. Gene 
Ther. 4:625-634; Malim, M.H. et al. (1992) J. Exp. Med. 1 197-1201 ; Daly, TJ. et al. 
(1993) Biochemistry 12:8945-8954; and Smith, C.A. et al. (1992) Virology 121:581-588), 
expression of trans-dominant negative envelope proteins, such as env mutants for HIV (see 
e.g., Steffy, K.R. et al. (1993) J. Virol 62:1854-1859), intracellular expression of antibodies, 

30 or fragments thereof, directed to viral products ("internal immunization", see e.g., Marasco, 
W.A. et al. (1993) Proc. Natl Acad. Sci. USA 20:7889-7893) and expression of soluble viral 
receptors, such as soluble CD4. Additionally, the system of the invention can be used to 
conditionally express a suicide gene in cells, thereby allowing for elimination of the cells 
after they have served an intended function. For example, cells used for vaccination can be 

35 eliminated in a subject after an immune response has been generated the subject by inducing 
expression of a suicide gene in the cells by administering Tc or a Tc analogue to the subject. 

The Tc-controlled regulatory system of the invention has numerous advantages 
properties that it particularly suitable for application to gene therapy. For example, the 
system provides an M on"/"off 1 switch for gene expression that allows for regulated dosaging a 



gene product in a subject. There are several situations in which it may be desirable to be able 
to provide a gene product at specific levels and/or times in a regulated manner, rather than 
simply expressing the gene product constitutively at a set level. For example, a gene of 
interest can be switched "on" at fixed intervals (e.g., daily, alternate days, weekly, etc.) to 
5 provide the most effective level of a gene product of interest at the most effective time. The 
level of gene product produced in a subject can be monitored by standard methods (e.g., 
direct monitoring using an immunological assay such as ELISA or RIA or indirectly by 
monitoring of a laboratory parameter dependent upon the function of the gene product of 
interest, e.g., blood glucose levels and the like). This ability to turn M on" expression of a gene 
10 at discrete time intervals in a subject while also allowing for the gene to be kept "off 1 at other 
times avoids the need for continued administration of a gene product of interest at 
intermittent intervals. This approach avoids the need for repeated injections of a gene 
product, which may be painful and/or cause side effects and would likely require continuous 
visits to a physician. In contrast, the system of the invention avoids these drawbacks. 
15 Moreover, the ability to turn "on" expression of a gene at discrete time intervals in a subject 
allows for focused treatment of diseases which involve "flare ups" of activity (e.g., many 
autoimmune diseases) only at times when treatment is necessary during the acute phase when 
pain and symptoms are evident. At times when such diseases are in remission, the expression 
system can be kept in the "off 1 state. 
20 Gene therapy applications that may particularly benefit from this ability to modulate 

gene expression during discrete time intervals include the following non-limiting examples: 
Rheumatoid arthritis - genes which encode gene products that inhibit the production 
of inflammatory cytokines (e.g., TNF, IL-1 and IL-12). can be expressed in subjects. 
Examples of such inhibitors include soluble forms of a receptor for the cytokine. 
25 Additionally or alternatively, the cytokines IL-10 and/or IL-4 (which stimulate a protective 
Th2-type response) can be expressed. Moreover, a glucocorticomimetic receptor (GCMR) 
can be expressed. 

Hypopituitarism - the gene for human growth hormone can be expressed in such 
subjects only in early childhood, when gene expression is necessary, until normal stature is 
30 achieved, at which time gene expression can be downregulated. 

Wound healing/Tissue regeneration - Factors (e.g., growth factors, angiogenic factors, 
etc.) necessary for the healing process can be expressed only when needed and then 
downregulated. 

Anti-Cancer Treatments - Expression of gene products useful in anti-cancer treatment 
35 can be limited to a therapeutic phase until retardation of tumor growth is achieved, at which 
time expression of the gene product can be downregulated. Possible systemic anti-cancer 
treatments include use of tumor infiltrating lymphocytes which express immunostimulatory 
molecules (e.g., IL-2, IL-12 and the like), angiogenesis inhibitors (PF4, IL-12, etc.), Her- 
regulin, Leukoregulin (see PCT Publication No. WO 85/04662), and growth factors for bone 




marrow support therapy, such as G-CSF, GM-CSF and M-CSF. Regarding the latter, use of 
the regulatory system of the invention to express factors for bone marrow support therapy 
allows for simplified therapeutic switching at regular intervals from chemotherapy to bone 
marrow support therapy (similarly, such an approach can also be applied to AIDS treatment, 
5 e.g., simplified switching from anti-viral treatments to bone marrow support treatment). 
Furthermore, controlled local targeting of anti-cancer treatments are also possible. For 
example, expression of a suicide gene by a regulator of the invention, wherein the regulator 
itself is controlled by, for example, a tumor-specific promoter or a radiation-induced 
promoter. 

10 In another embodiment, the regulatory system of the invention is used to express 

angiogenesis inhibitors) from within a tumor via a transgene regulated by the system of the 
invention. Expression of angiogenesis inhibitors in this manner may be more efficient than 
systemic administration of the inhibitor and would avoid any deleterious side effects that 
might accompany systemic administration. In particular, restricting angiogenesis inhibitor 

1 5 expression to within tumors could be particularly useful in treating cancer in children still 
undergoing angiogenesis associated with normal cell growth. 

In another embodiment, high level regulated expression of cytokines may represent a 
method for focusing a patients own immune response on tumor cells. Tumor cells can be 
transduced to express chemoattractant and growth promoting cytokines important in 

20 increasing an individual's natural immune response. Because the highest concentrations of 
cytokines will be in the proximity of the tumor, the likelihood of eliciting an immunological 
response to tumor antigens is increased. A potential problem with this type of therapy is that 
those tumor cells producing the cytokines will also be targets of the immune response and 
therefor the source of the cytokines will be eliminated before eradication of all tumor cells 

25 can be certain. To combat this, expression of viral proteins known to mask infected cells 

from the immune system can be placed under regulation, along with the cytokine gene(s), in 
the same cells. One such protein is the El 9 protein from adenovirus (see e.g., Cox, Science 
247:715). This protein prevents transport of class I HLA antigens to the surface of the cell 
and hence prevents recognition and lysis of the cell by the host's cytotoxic T cells. 

30 Accordingly, regulated expression of El 9 in tumor cells could shield cytokine producer cells 
from cytotoxic T cells during the onset of an immune response provoked by cytokine 
expression. After a sufficient period of time has elapsed to eradicate all tumor cells but those 
expressing El 9, El 9 expression can be turned off, causing these cells then to fall victim to 
the provoked anti-tumor immune response. 

35 Benign prostatic hyp ertrophy - Similar to the above, a suicide gene can be regulated 

by a regulator of the invention, wherein the regulator itself is controlled by, for example, a 
prostate-specific promoter. 

The ability to express a suicide gene (e.g., an apoptosis gene, TK gene, etc) in a 
controlled manner using the regulatory system of the invention adds to the general safety and 




usefulness of the system. For example, at the end of a desired therapy, expression of a 
suicide gene can be triggered to eliminate cells carrying the gene therapy vector, such as cells 
in a bioinert implant, cells that have disseminated beyond the intended original location, etc. 
Moreover, if a transplant becomes tumorous or has side effects, the cells can be rapidly 
5 eliminated by induction of the suicide gene. The use of more than one Tc-controlled 

"on'Voff" switch in one cell allows for completely independent regulation of a suicide gene 
compared to regulation of a gene of therapeutic interest (as described in detail herein). 

The regulatory system of the invention further offers the ability to establish a 
therapeutically relevant expression level for a gene product of interest in a subject, in contrast 

10 to unregulated constitutive expression which offers no flexibility in the level of gene product 
expression that can be achieved. A physiologically relevant level of gene product expression 
can be established based on the particular medical need of the subject, e.g., based on 
laboratory tests that monitor relevant gene product levels (using methods as described above). 
In addition to the clinical examples and gene products already discussed above with gene to 

1 5 dosaging of the gene product, other therapeutically relevant gene products which can be 
expressed at a desired level at a desired time include: Factor XIII and IX in hemophiliacs 
(e.g., expression can be elevated during times of risk of injury, such as during sports); insulin 
or amylin in diabetics (as needed, depending on the state of disease in the subject, diet, etc.); 
erythropoietin to treat erythrocytopenia (as needed, e.g., at end-stage renal failure); low- 

20 density lipoprotein receptor (LDLr) or very low-density lipoprotein receptor (VLDLr) for 
artherosclerosis or gene therapy in liver (e.g, using ex vivo implants). Applications to 
treatment of central nervous system disorders are also encompassed. For example, in 
Alzheimer's disease, "fine tuned" expression of choline acetyl transferase (ChAT) to restore 
acetylcholine levels, neurotrophic factors (e.g., NGF, BDNGF and the like) and/or 

25 complement inhibitors (e.g., sCRl, sMCP, sDAF, sCD59 etc.) can be accomplished. Such 
gene products can be provided, for example, by transplanted cells expressing the gene 
products in a regulated manner using the system of the invention. Moreover, Parkinson's 
disease can be treated by "fine tuned" expression of tyrosine hydroxylase (TH) to increase 
levodopa and dopamine levels. 

30 In addition to the proteinaceous gene products discussed above, gene products that are 

functional RNA molecules (such as anti-sense RNAs and ribozymes) can be expressed in a 
controlled manner in a subject for therapeutic purposes. For example, a ribozyme can be 
designed which discriminates between a mutated form of a gene and a wild-type gene. 
Accordingly, a "correct" gene (e.g., a wild-type p53 gene) can be introduced into a cell in 

35 parallel with introduction of a regulated ribozyme specific for the mutated form of the gene 
(e.g., a mutated endogenous p53 gene) to remove the defective mRNA expressed from the 
endogenous gene. This approach is particularly advantageous in situations in which a gene 
product from the defective gene would interfere with the action of the exogenous wild-type 
gene. 




Expression of a gene product in a subject using the regulatory system of the invention 
is modulated using tetracycline or analogues thereof. Such drugs can be administered by any 
route appropriate for delivery of the drug to its desired site of action (e.g., delivery to cells 
containing a gene whose expression is to be regulated). Depending on the particular cell 
5 types involved, preferred routes of administration may include oral administration, 

intravenous administration and topical administration (e.g. y using a transdermal patch to reach 
cells of a localized transplant under the skin, such as keratinocytes, while avoiding any 
possible side effects from systemic treatment). 

In certain gene therapy situations, it may be necessary or desirable to take steps to 

10 avoid or inhibit unwanted immune reactions in a subject receiving treatment. To avoid a 
reaction against the cells expressing the therapeutic gene product, a subject's own cells are 
generally used, when possible, to express the therapeutic gene product, either by in vivo 
modification of the subject's cells or by obtaining cells from the subject, modifying them ex 
vivo and returning them to the subject. In situations where allogeneic or xenogeneic cells are 

1 5 used to express a gene product of interest, the regulatory system of the invention, in addition 
to regulating a therapeutic gene, can also be used to regulate one or more genes involved in 
the immune recognition of the cells to inhibit an immune reaction against the foreign cells. 
For example, cell-surface molecules involved in recognition of a foreign cell by T 
lymphocytes can be downmodulated on the surface of a foreign cell used for delivery of a 

20 therapeutic gene product, such as by regulated expression in the foreign cell of a ribozyme 
which cleaves the mRNA encoding the cell-surface molecule. Particularly preferred cell 
surface molecules which can be downmodulated in this manner to inhibit an unwanted 
immune response include class I and/or class II major histocompatibility complex (MHC) 
molecules, costimulatory molecules (e.g., B7-1 and/or B7-2), CD40, and various "adhesion" 

25 molecules, such as ICAM-1 or ICAM-2. Using approaches described herein for independent 
but coordinate regulation of multiple genes in the same cell, the down-regulation of 
expression of a cell-surface molecule(s) in a host cell can be coordinated with the up- 
regulation of expression of a therapeutic gene. Accordingly, after therapy is completed and 
expression of the therapeutic gene is halted, expression of the endogenous cell surface 

30 molecule(s) can be restored to normal. 

Furthermore, as described above regarding anti-cancer treatments, a viral protein (e.g., 
adenovirus El 9 protein) that downmodulates expression of MHC antigens can be regulated in 
host cells using the system of the invention as a means of avoiding unwanted immunological 
reactions. 

35 In addition to avoiding or inhibiting an immune response against a foreign cell 

delivering a therapeutic gene product, it may also be necessary, in certain situations, to avoid 
or inhibit an immune response against certain components of the regulatory system of the 
invention (e.g., the regulator fusion proteins described herein) that are expressed in a subject, 
since these fusion proteins contain non-mammalian polypeptides that may stimulate an 




unwanted immune reaction. In this regard, regulator fusion proteins can be designed and/or 
selected for a decreased ability to stimulate an immune response in a host. For example, a 
transcriptional activator domain for use in the regulator fusion protein can be chosen which 
has minimal immunogenicity. In this regard, a wild-type transcriptional activation domain of 
5 the herpes simplex virus protein VP1 6 may not be a preferred transcriptional activation 

domain for use in vivo, since it may stimulate an immune response in mammals. Alternative 
transcriptional activation domains can be used, as described herein, based on their reduced 
immunogenicity in a subject. For example, a transcriptional activation domain of a protein of 
the same species as the host may be preferred (e.g., a transcriptional activation domain from a 
10 human protein for use of a regulatory fusion protein in humans). Alternatively, a regulatory 
fusion protein of the invention can be modified to reduce its immunogenicity in subjects, e.g., 
by identifying and modifying one or more dominant T cell epitopes within a polypeptide of 
the fusion protein (e.g., either the Tet repressor moiety or the transcriptional modulator 
moiety, such as a VP 16 polypeptide). Such T cell epitopes can be identified by standard 
15 methods and altered by mutagenesis, again by standard methods. A modified form of a 
regulator fusion protein can then be selected which retains its original transcriptional 
regulatory ability yet which exhibits reduced immunogenicity in a subject as compared to an 
unmodified fusion protein. 

In addition to the foregoing, all conventional methods for generally or specifically 
20 downmodulating immune responses in subjects can be combined with the use of the 

regulatory system of the invention in situations where inhibition of immune responses is 
desired. General immunosuppressive agents, such as cyclosporin A and/or FK506, can be 
administered to the subject. Alternatively, immunomodulatory agents which may allow for 
more specific immunosuppression can be used. Such agents may include inhibitors of 
25 costimulatory molecules (e.g., a CTLA4Ig fusion protein, soluble CD4, anti-CD4 antibodies, 
anti-B7-l and/or anti-B7-2 antibodies or anti-gp39 antibodies). 

Finally, in certain situations, a delivery vehicle for cells expressing a therapeutic gene 
can be chosen which minimizes exposure of transplanted cells to the immune system. For 
example, cells can be implanted into bioinert capsules/biocompatible membranes with pores 
30 which allow for diffusion of proteins (e.g., a therapeutic gene product of interest) out of the 
implant and diffusion of nutrients and oxygen into the implant but which prevent entry of 
immune cells, thereby avoiding exposure of the transplanted cells to the immune system (as 
has been applied to islet cell transplantation). 

35 

B. Production of Proteins in Vitro 

Large scale production of a protein of interest can be accomplished using cultured 
cells in vitro which have been modified to contain 1) a nucleic acid encoding a transactivator 
fusion protein of the invention in a form suitable for expression of the transactivator in the 




cells and 2) a gene encoding the protein of interest operatively linked to a tet operator 
sequence(s). For example, mammalian, yeast or fungal cells can be modified to contain these 
nucleic acid components as described herein. The modified mammalian, yeast or fungal cells 
can then be cultured by standard fermentation techniques in the presence of Tc or an analogue 
5 thereof to induce expression of the gene and produce the protein of interest. Accordingly, the 
invention provides a production process for isolating a protein of interest. In the process, a 
host cell (e.g., a yeast or fungus), into which has been introduced both a nucleic acid 
encoding a transactivator fusion protein of the invention and a nucleic acid encoding the 
protein of the interest operatively linked to at least one tet operator sequence, is grown at 

10 production scale in a culture medium in the presence of tetracycline or a tetracycline 
analogue to stimulate transcription of the nucleotides sequence encoding the protein of 
interest (i.e., the nucleotide sequence operatively linked to the tet operator sequence(s)) and 
the protein of interest is isolated from harvested host cells or from the culture medium. 
Standard protein purification techniques can be used to isolate the protein of interest from the 

15 medium or from the harvested cells. 

C Production of Proteins in Vivo 

The invention also provides for large scale production of a protein of interest in 
animals, such as in transgenic farm animals. Advances in transgenic technology have made it 

20 possible to produce transgenic livestock, such as cattle, goats, pigs and sheep (reviewed in 
Wall, RJ. et al. (1992) J. Cell Biochem. 42:1 13-120; and Clark, A.J. et al. (1987) Trends in 
Biotechnology 5:20-24). Accordingly, transgenic livestock carrying in their genome the 
components of the inducible regulatory system of the invention can be constructed, wherein a 
gene encoding a protein of interest is operatively linked to at least one tet operator sequence. 

25 Gene expression, and thus protein production, is induced by administering Tc (or analogue 
thereof) to the transgenic animal. Protein production can be targeted to a particular tissue by 
linking the nucleic acid encoding the transactivator fusion protein to an appropriate tissue- 
specific regulatory element(s) which limits expression of the transactivator to certain cells. 
For example, a mammary gland-specific regulatory element, such as the milk whey promoter 

30 (U.S. Patent No. 4,873,3 16 and European Application Publication No. 264,166), can be 
linked to the transactivator transgene to limit expression of the transactivator to mammary 
tissue. Thus, in the presence of Tc (or analogue), the protein of interest will be produced in 
the mammary tissue of the transgenic animal. The protein can be designed to be secreted into 
the milk of the transgenic animal, and if desired, the protein can then be isolated from the 

35 milk. 



D. Animal Models of Human Disease 

The transcriptional activator and inhibitor proteins of the invention can be used alone 
or in combination to stimulate or inhibit expression of specific genes in animals to mimic the 




pathophysiology of human disease to thereby create animal models of human disease. For 
example, in a host animal, a gene of interest thought to be involved in a disease can be placed 
under the transcriptional control of one or more tet operator sequences (e.g., by homologous 
recombination, as described herein). Such an animal can be mated to a second animal 
5 carrying one or more transgenes for a transactivator fusion protein and/or an inhibitor fusion 
protein to create progeny that carry both a tetracycline-regulated fusion protein(s) gene and a 
tet-regulated target sequence. Expression of the gene of interest in these progeny can be 
modulated using tetracycline (or analogue). For example, expression of the gene of interest 
can be downmodulated using a transcriptional inhibitor fusion protein to examine the 
10 relationship between gene expression and the disease. Such an approach may be 

advantageous over gene "knock out" by homologous recombination to create animal models 
of disease, since the tet-regulated system described herein allows for control over both the 
levels of expression of the gene of interest and the timing of when gene expression is down- 
or up-regulated. 

15 

E. Production of Stable Cell Lines for Gene Cloning and Other Uses 

The transcriptional inhibitor system described herein can be used keep gene 
expression "off 1 (i.e., expressed) to thereby allow production of stable cell lines that 
otherwise may not be produced. For example, stable cell lines carrying genes that are 

20 cytotoxic to the cells can be difficult or impossible to create due to "leakiness" in the 

expression of the toxic genes. By repressing gene expression of such toxic genes using the 
transcriptional inhibitor fusion proteins of the invention, stable cell lines carrying toxic genes 
may be created. Such stable cell lines can then be used to clone such toxic genes (e.g., 
inducing the expression of the toxic genes under controlled conditions using Tc or analog). 

25 General methods for expression cloning of genes, to which the transcriptional inhibitor 
system of the invention can be applied, are known in the art (see e.g., Edwards, C. P. and 
Aruffo, A. (1993) Curr. Opin. Biotech. 4:558-563) Moreover, the transcriptional inhibitor 
system can be applied to inhibit basal expression of genes in other cells to create stable cell 
lines, such as in embryonic stem (ES) cells. Residual expression of certain genes introduced 

30 into ES stems may result in an inability to isolate stably transfected clones. Inhibition of 

transcription of such genes using the transcriptional inhibitor system described herein may be 
useful in overcoming this problem. 

Advantages 

35 The inducible regulatory system of the invention utilizing a transactivator fusion 

protein addresses and overcomes many of the limitations of other inducible regulatory 
systems in the art. For example, very high intracellular concentrations of the transcriptional 
activator fusion protein of the invention are not required for efficient regulation of gene 
expression. Additionally, since gene expression is induced by adding rather than removing 




the inducing agent, the induction kinetics in the system of the invention are not limited by the 
rate of removal of the inducing agent and thus are typically faster. Moreover, the inducing 
agent is only present when gene transcription is induced, thereby avoiding the need for the 
continuous presence of an agent to keep gene expression off. 
5 Use of the transcriptional inhibitor fusion proteins of the invention to inhibit 

transcription in eukaryotic cells also provide advantages over the use of prokaryotic 
repressors alone (e.g., TetR, lacR) to inhibit transcription in eukaryotic cells. Since the 
inhibitor fusion proteins of the invention contain a eukaryotic transcriptional silencer domain, 
these fusion proteins should be more efficient at repressing transcription in eukaryotic cells, 
10 and thus may potentially require lower intracellular concentrations for efficient repression 
with less liklihood of "leakiness". Additionally, by insertion of tetO sequences into the 
regulatory region of an endogenous gene, the transcriptional inhibitor fusion proteins of the 
invention can be used to down-regulate constitutive and/or tissue-specific expression of 
endogenous genes. 

15 Furthermore, in contrast to various versions of the lac system (e.g., Labow et al. 

(1990) Mol Cell Biol JJ}:3343-3356; Bairn et al. (1991) Proc. Natl Acad. Sci. USA 
&&:5072-5076), which are limited by the negative properties of the inducing agent (IPTG) 
and/or by the need to increase the temperature in order to induce gene expression (which may 
elicit pleiotropic effects), the inducing agent used in the system of the invention (Tc or an 

20 analogue thereof) has many advantageous properties: 1) Tc and analogues thereof exhibit 
high affinity for TetR and low toxicity for eukaryotic cells, and thus can be used for gene 
induction at concentrations that do not affect cell growth or morphology; 2) Tc analogues 
which retain TetR binding but which have reduced antibiotic activity exist and can be used as 
inducing agents, thereby avoiding possible side effects from the antibiotic property of Tc; 3) 

25 the pharmacokinetic properties of Tc and Tc analogues enable rapid and efficient cellular 
uptake and penetration of physiological barriers, such as the placenta or the blood-brain 
barrier; and 4) Tc analogues with different induction capabilities permit modulation of the 
level of gene expression. 

Thus, the invention provides an inducible regulatory system which allows for rapid 

30 activation of gene transcription without cellular toxicity and a range of induction indices. 

The increase in gene expression upon induction typically is between 1000- and 2000-fold and 
can be as high as about 20,000-fold. Alternatively, lower levels of gene induction, e.g., 10- 
fold, can be achieved depending upon which inducing agent is used. This system can be 
utilized in a wide range of applications. These applications include gene therapy, large-scale 

35 production of proteins in cultured cells or in transgenic farm animals, and the study of gene 
function, for example in relationship to cellular development and differentiation. Moreover, 
the novel transcription units of the invention allow for coordinate or independent regulation 
of the expression of multiple genes utilizing the regulatory components of the invention. 




-53- 



This invention is further illustrated by the following examples which should not be 
construed as limiting. The contents of all references, patents and published patent 
applications cited throughout this application are hereby incorporated by reference. 



A "reverse" Tet repressor, which binds to its target DNA in the presence rather than 
the absence of tetracycline, was generated by chemical mutagenesis and selection essentially 

10 as described in Hecht, B. et al. (1993) J, Bacteriology 12^:1206-1210. Single-stranded DNA 
(coding and non-coding strands) encoding the wild-type TnlO-derived Tet repressor was 
chemically mutagenized with sodium nitrite. Single-stranded DNAs (40 jag in 40 jal in Tris- 
EDTA buffer) were mixed with 10 jil of 2.5 M sodium acetate (pH 4.3) and 50 |il of sodium 
nitrate ranging between 0.25 M and 2 M and incubated for 45 to 60 minutes at room 

1 5 temperature. After mutagenesis, the complementary strand was synthesized using reverse 
transcriptase or by amplification using the polymerase chain reaction with Taq DNA 
polymerase. Since the mutagenesis procedure yields multiple mutations in the DNA, three 
fragments of the gene, of about 200 base pairs each, were individually subcloned into a wt 
Tet repressor gene in a recombinant expression vector to replace the corresponding portion of 

20 the wild-type gene. This created a pool of mutated Tet repressor genes wherein each gene 
had mostly single mutations in the 200 base pair mutagenized fragment of the gene. 

The pool of mutated Tet repressors were screened in a genetic assay which positively 
selects for a functional interaction between a Tet repressor and its cognate operator using E. 
coll strain WH207(A,WH25) (the construction of this strain is described in detail in 

25 Wissmann, A. et al. (1991) Genetics 12&:225-232). In this E. coli strain, tet operators direct 
the expression of divergently arranged P-galactoside (lacZ) and Lac repressor (lacl) genes 
and the lac regulatory region directs the expression of a galactokinase igalK) gene. Binding 
of Tet repressors to tet operators turns off transcription of the lacl and lacZ genes. The 
absence of Lac repressor allows for expression of the galK gene, which enables the E. coli 

30 strain to use galactose as a sole carbon source, which serves as one marker. The lacZ~ 

phenotype serves as a second marker. Thus, bacteria containing Tet repressors which bind to 
tet operators have a Gal + , lacZ~ phenotype. Bacteria containing wild-type Tet repressors 
have a Gal + , lacZ' phenotype in the absence of tetracycline. A mutated "reverse" Tet 
repressor (rTetR) was selected based upon a Gal + , lacZ' phenotype in the presence of 

35 tetracycline. 

The nucleotide and amino acid sequence of the rTetR mutant are shown in SEQ ID 
NOs: 1 (nucleotide positions 1-621) and 2 (amino acid positions 1-207), respectively. 
Sequence analysis of the rTetR mutant showed the following amino acid and nucleotide 
changes: 



5 



EXAMP LE 1: 



Selection of a Mutated Tet Repressor and Construction 
of a Tetracycline Inducible Transcriptional Activator 



aa (position) 



wild-type 



mutant 



affected codon 
wild-type mytent 



glu(71) 
asp (95) 
leu (101) 
gly(102) 



lys 
asn 
ser 
asp 



GAA 
GAT 
TTA 
GGT 



AAA 
AAT 
TCA 
GAT 



Two additional mutations did not result in an amino acid exchange: 



aa (position) 
wild-type mutfflt 



affected codon 
wild-type mutant 



leu (41) 
arg (80) 



leu 
arg 



TTG 
CGT 



CTG 
CGC 



To convert the rTetR mutant to a transcriptional activator, a 399 base pair 
XbaI/Eco47III fragment encoding amino acids 3 to 135 of rTetR (i.e., encompassing the 

1 0 mutated region) was exchanged for the corresponding restriction fragment of the expression 
vector pUHD15-l to create pUHD17-l. In pUHD15-l, nucleotide sequences encoding wild- 
type TetR are linked in frame to nucleotide sequences encoding the C-terminal 130 amino 
acids of herpes simplex virus VP 16. These transactivator sequences are flanked upstream by 
a CMV promoter/enhancer and downstream by an SV40 poly(A) site (the construction of 

15 pUHD15-l is described in more detail in USSN 08/076,726 and Gossen, M. and Bujard, H. 
(1992) Proa Natl. Acad ScL USA £2:5547-5551). Thus, in pUHD17-l, nucleotide 
sequences encoding the reverse TetR mutant are linked in frame to VP 16 sequences to create 
a reverse Tc-controlled transactivator (referred to herein as tTA R ). The analogous exchange 
of the mutated region of rTetR for the wild-type region of TetR was performed with plasmid 

20 pUHD 152-1, which is the same as pUHD15-l except that it additionally contains nucleotide 
sequences encoding a nuclear localization signal linked in-frame to the 5 1 end of the 
nucleotide sequences encoding the Tet repressor. The amino acid sequence of the nuclear 
localization signal is MPKRPRP (SEQ ID NO: 5), which is linked to the serine at amino acid 
position 2 of TetR. The resulting expression vector encoding the reverse Tc-controlled 

25 transactivator including a nuclear localization signal (referred to herein as ntTA R ) was named 
pUHD 172-1. 




EXAMPLE 2 : Tetracycline-Induced Stimulation of Transcription by tTA R 

Transient Transfection 

The pUHD17-l and pUHD 172-1 expression vectors were transiently transfected by a 
5 standard calcium phosphate method into HeLa cells together with a reporter plasmid, 
pUHC13-3, in which heptameric tet operators are fused upstream of a minimal 
hCMV-promoter and a luciferase reporter gene (the reporter plasmid is described in detail in 
USSN 08/076,726 and Gossen, M. and Bujard, H. (1992) Proc, Natl Acad. Sci. USA 
52:5547-5551). After incubation of the transfected cells at 37 °C for 20 hours in the presence 

10 or absence of tetracyline (or an analogue thereof), luciferase activity was assayed as follows: 
Cells grown to -80 % confluency in 35 mm dishes in Eagle's minimal essential medium were 
washed with 2 ml of phosphate-buffered saline before they were lysed in 25 mM Tris 
phosphate, pH 7.8/2 mM dithiothreitol/2 mM diaminocyclohexanetetraacetic acid/10 % 
glycerol/1 % Triton-X-100 for 10 minutes at room temperature. The lysate was scraped off 

15 the culture dishes and centrifiiged for 10 seconds in an Eppendorf centrifuge. Next, aliquots 
(10 jil) of the supernatant were mixed with 250 \i\ of 25 mM glycylglycine/15 mM MgS04/5 
mM ATP and assayed for luciferase activity in a Lumat LB9501 (Berthold, Wildbad, F.R.G.) 
using the integral mode (10 seconds). D-Luciferin (L6882, Sigma) was used at 0.5 mM. The 
background signal measured in extracts of HeLa cells that did not contain a luciferase gene 

20 was indistinguishable from the instrument background (80-120 relative light units (rlu)/10 
sec). Protein content of the lysate was determined according to Bradford (Bradford, M.M. 
(1976) Anal Biochem. 22:248-254). Cells transfected with plasmids encoding either tTA R or 
ntTA R showed an increased level of luciferase activity in the presence of tetracyclines. This 
effect was consistently more pronounced when anhydrotetracycline (ATc) was used instead 

25 of tetracycline. 

Stable Transfection 

After this transient transfection analysis, expression vectors were prepared for stable 
transfection of cells. A pSV2neo-derived neomycin resistance cassette (described in 

30 Southern, P.J. and Berg, P. (1982) J. Mol Appl Genet 1:327-341) was integrated into the 
transactivator expression vectors pUHD17-l and pUHD 172-1, resulting in pUHD17-lneo 
and pUHD172-lneo, respectively. pUHD172-lneo, coding for ntTA R , was stably integrated 
into HeLa cells by standard techniques. Ten G41 8 -resistant cell clones were analyzed for 
their phenotype by transient supertransfection with pUHC13-3 carrying the luciferase gene 

35 under the control of a minimal CMV promoter and tet operators. Three clones, HR4, HR5 

and HR10, showed a strong increase of luciferase activity in the presence of ATc. From these 
clones, HR5 was selected for further experiments. 

To create stable transfectants for both ntTA R and a tet operator-linked luciferase 
reporter gene, HR5 cells were cotransfected with pUH13-3 and pHMR272, which encodes 




for hygromycin resistance (see Bernhard, H-U. et al. (1985) Exp. Cell Res. 15&:237-243), and 
hygromycin resistant clones were selected. In an analogous experiment, HR5 cells were 
cotransfected with pUH13-7 and pHMR272. pUH13-7 contains a minimal promoter 
sequence spanning position +19 to -37 of the HSVtk promoter adjacent to the heptameric 
5 tetO sequences, rather than a minimal CMV promoter. From 21 hygromycin resistant clones, 
10 showed inducible luciferase activity upon addition of Tc or doxycyline (Dc) to the culture 
medium. Clones containing the luciferase reporter gene linked to a minimal CMV promoter 
are referred to as HR5-C, whereas those containing the luciferase reporter gene linked to a 
minimal tk promoter are referred to HR5-T. 

10 Six of the HR5 clones stably transfected with a ntTAR-dependent reporter unit and 

previously shown to be responsive to tetracyclines were grown in parallel in the absence or 
presence of 1 jag/ml doxycycline. About 3 x 10 4 cells were plated in each 35 mm dish (4 
dishes for each clone). After growth for 60 hours, cells were harvested and the luciferase 
activity of the extracts (in relative light units (rlu)/jag extracted protein) was determined. As 

1 5 shown in Table 1 , the absolute expression levels of six clones demonstrate that activation of 
luciferase gene expression over 3 orders of magnitude is achieved in several of the double 
stable cell lines containing the ntTA R regulatory system. 

It should be noted that even higher induction factors (e.g., as high as a 20,000-fold 
increase in expression) could be achieved if, instead of simply adding the inducing agent to 

20 the culture medium, the cells were washed prior to induction and then replated in fresh 
culture medium containing the inducing agent. 

TABLE 1 . Doxvcvcline-dependent luciferase a ctivity of double stable luc+/HR5 cell clones 

Luciferase Activity, rlu/|4,g protein 



25 


Clone 


-Doxvcvcline 


+Doxvcvcline 


Induction Factor 




HR5-C6 


65 


54,911 


845 






62 


69,525 


1120 




HR5-C11 


100 


165,671 


1660 


30 




142 


179,651 


1270 




HR5-C14 


43 


44,493 


1030 






43 


56,274 


1310 


35 


HR5-T2 


56 


16,696 


298 






40 


16,416 


410 




HR5-T15 


6.8 


1838 


270 






6.5 


1688 


260 


40 












HR5-T19 


4.8 


1135 


236 






5.4 


1285 


237 



57- 



Induction of luciferase activi ty hv different tetracyclines 

The ability of tetracycline and several different tetracycline analogues to induce 
luciferase expression in HR5-C1 1 cells was examined. HR5-C1 1 cells plated at a density of 
5 about 3 x 10^ cells/35 mm dish (-80% confluency). After full attachment of the cells, the 
following tetracyclines ere added to the cultures at a concentration of 1 jag/ml: tetracycline- 
HC1 (Tc), oxytetracycline-HCl (OTc), chlorotetracycline (CTc), anhydrotetracycline-HCl 
(ATc) and doxycycline-HCl (Doxy). These compounds are commercially available from 
Sigma Chemical Co., St. Louis, MO, and were kept in aqueous solution at a concentration of 

10 1 jag/ml. Cells grown in the absence of antibiotic (-) served as a control. After 3 days, the 
cells were harvested and the luciferase activity and the protein content of the extracts were 
determined. The results are shown in the bar graph of Figure 1 . Each bar in the figure 
(closed and hatched) represents the relative luciferase activity (normalized toward the amount 
of extracted protein) of a single culture dish. The mean of the luciferase activities obtained 

15 from the two plates grown without tetracyclines was defined as 1. Tc, CTc and OTc showed 
modest stimulation of luciferase activity. By contrast, ATc and Doxy stimulated luciferase 
activity approximately 1000 and 1500 fold, respectively. 

Dose-response o f luciferase activity to doxvcvcline in HR5-C1 1 cells 
20 The above-described experiment examining the induction ability of different 

tetracyclines revealed that doxycycline was the most potent effector of the tetracyclines 
examined. Doxycycline was therefore selected to quantitatively analyze its dose-response. 
HR5-C1 1 cells were incubated with different concentrations of doxycycline and luciferase 
activity was measured. The data of three independent experiments are shown in Figure 2. At 
25 less than 10 ng/ml in the culture medium, doxycycline is ineffective at inducing luciferase 
activity. However, when the concentration was raised above 10 ng/ml, an almost linear 
increase in expression of luciferase was observed. Maximal activation was achieved at 1 
jig/ml. At concentrations above 3 p.g/ml, doxycycline showed a slight growth-inhibitory 
effect on HeLa cells as determined in a MTT-assay. 

30 

Kinetics of induction of the ntTA R system 

To examine the kinetics of doxycycline-induced ntTA^mediated induction of gene 
expression, the time course of induction of luciferase activity in HR5-C1 1 cells was 
monitored after addition of doxycycline to the medium (final concentration 1 jag/ml). Cells 
35 were cultured in the presence of doxycycline and after various time intervals, the cells were 
harvested and luciferase activity was determined as described above. As shown in Figure 3, a 
100-fold induction of luciferase activity was observed after 5.5 hours incubation with Doxy. 
Fully induced levels were achieved in less than 24 hours of incubation with Doxy. Thus, 




these results indicate that induction of gene expression occurs rapidly following exposure of 
the cells to the inducing agent. 

EXAMPLE 3: Coordinate Regulation of the Expression of Two Nucleotide 

5 Sequences by a Tc-Controlled Transcriptional Activator 

A recombinant expression vector for coordinate, bidirectional transcription of two 
nucleotide sequences was constructed comprising, in a 5 1 to 3' direction: a luciferase gene, a 
first minimal promoter, seven tet operator sequences, a second minimal promoter and a LacZ 

10 gene. The construct is illustrated in Figure 6. In this construct, the luciferase and LacZ genes 
are oriented such that they are transcribed in opposite orientations relative to the tet operator 
sequences, i.e., the luciferase gene is transcribed in a 5' to 3* direction from the bottom strand 
of DNA, whereas the LacZ gene is transcribed in a 5' to 3* direction from the top strand of 
DNA. The luciferase gene is followed by an SV40 polyadenylation signal, whereas the LacZ 

15 gene is followed by a p-globin polyadenylation signal. 

The construct was transfected into the HeLa cell line HtTA-1 cells, which express a 
wild-type Tet repressor-VP16 fusion protein (referred to as tTA and described in Gossen, M. 
and Bujard, H. (1992) Proc. Natl Acad. Set USA £2:5547-5551). The tTA fusion protein 
binds to tet operator sequences in the absence of Tc (or analogue) but not in the presence of 

20 Tc (or analogue). The construct was cotransfected into HtTA-1 cells with a plasmid which 
confers hygromycin resistance and stably transfected clones were selected based upon their 
hygromycin resistant phenotype. Selected hygromycin resistant (Hygr 1 ) clones were 
examined for luciferase and p-galactosidase activity. Clones positive for all three markers 
(Hygr r , luc" 1 ", (3-gal + ) were then examined for tetracycline-dependent coregulation of 

25 expression of luciferase and p-galactosidase activity by cultureing the clones in increasing 

amounts of tetracycline and measuring luciferase and p-galactosidase activity. The results of 
such an experiment using clone Htl 3 16-8/50 are shown in Figure 8. In the absence of 
tetracycline (in which case tTA can bind to tet operators and activate gene expression), both 
luciferase and p-galactosidase activity is detected. In the presence of increasing amounts of 

30 tetracycline, luciferase and p-galactosidase activity are coordinately and equivalently 

downregulated. This data demonstrates that expression of two genes can be coordinately 
regulated by a tetracycline-controlled transactivator by operatively linking the two genes to 
the same tet operator sequence(s). 

35 EXAMPLE 4 : Construction of a Tet racy cline-ReguIated Transcriptional Inhibitor 

Fusion Protein Comprising TetR and a Krueppel Silencer Domain 

To contruct an expression vector encoding a tetracycline-regulated transcriptional 
inhibitor of the invention (also referred to as a tetacycline controlled silencer domain, or 




tSD), a nucleic acid fragment encoding a transcriptional silencer domain is ligated into an 
expression vector containing nucleotide sequences encoding a wild-type or modified (i.e., 
mutated) TetR such that the silencer domain coding sequences are ligated in-frame with the 
TetR coding sequences. The plasmid pUHD141sma-l contains nucleotide sequences 
5 encoding a wild-type TnlO-derived Tet repressor (the nucleotide and amino acid sequences of 
which are shown in SEQ ID NOs: 16 and 17, respectively). In pUHD141sma-l, the TetR 
coding sequence is linked at its 5 f end to a CMV promoter and at its immediate 3' end to a 
nucleotide sequence that creates a polylinker into which additional nucleic acid fragments can 
be introduced. The nucleotide sequence across this polylinker region is: TCC CCG GGT 

1 0 AAC TAA GTA AGG ATC C (SEQ ID NO: 24) (wherein TCC CCG GGT ACC encode 
amino acid residues 205-208 of TetR, namely Ser-Gly-Ser-Asn). This polylinker region 
includes restriction endonuclease sites for PspAI (CCC GGG) and BamHI (GGA TCC). 
Downstream of the polylinker region, the plasmid contains an SV40-derived polyadenylation 
signal. The pUHD141sma-l vector is illustrated schematically in Figure 11. 

15 To construct an expression vector encoding a fusion protein between TetR and a 

transcriptional silencer domain from the Drosophila Krueppel (Kr) protein, a nucleic acid 
fragment encoding a silencer domain from Kr is amplified by the polymerase chain reaction 
(PCR) using Kr cDNA as a template. Oligonucleotide primers are designed which amplify a 
nucleic acid fragment encoding the C-terminal 64 amino acids of Kr (referred to as C64KR). 

20 This region corresponds to amino acid positions 403-466 of the native protein. The 

nucleotide and amino acid sequences of C64KR are shown in SEQ ID NO: 20 and SEQ ID 
NO: 21, respectively. PCR primers are designed to include' restriction endonuclease sites 
such that the resultant amplified fragment contains restriction endonuclease sites at its 5' and 
3' ends. Restriction endonuclease sites are chosen that are contained within the polylinker of 

25 pUHD141sma-l which allow in-frame, directional ligation of the amplified fragment into the 
polylinker site. For example, PCR primers are designed which incorporate a PspAI site 
(CCC GGG) at the 5' end of the fragment encoding C64KR and a BamHI site at the 3' end of 
the fragment. After a standard PCR reaction, the amplified fragment and pUHD141-smal are 
digested with PspAI and BamHI. The amplified fragment is then ligated directionally into 

30 the polylinker site of pUHD141-smal using standard ligation conditions to create the 

expression vector pUHD141kr-l . Standard techniques are used to isolate the desired plasmid 
and confirm its construction. Construction of pUHD141kr-l is illustrated schematically in 
Figure 1 1 . 

The resultant pUHD141kr-l expression vector contains nucleotide sequences 
35 encoding a fusion protein comprising amino acids 1-207 of the wild type TetR linked in- 
frame to amino acids 403-466 of Kr (C64KR). The nucleotide and amino acid sequences 
across the junction of the fusion protein are as follows: AGT GGG TCC CCG GGT GAC 
ATG GAA (SEQ ID NO: 25) and Ser-Gly-Ser-Pro-Gly-Asp-Met-Glu (SEQ ID NO: 26). Ser- 



-60- 



Gly-Ser corresponds to amino acids 205-207 of TetR, Pro-Gly are encoded by the polylinker 
and Asp-Met-Glu correspond to the amino acids 403-405 of C64KR. 

Similarly, an expression vector encoding a fusion protein of a mutated TetR (that 
binds to tetO only in the presence of Tc) and C64KR can be constructed as described above 
5 using nucleotide sequences encoding a mutant TetR (the nucleotide and amino acid 

sequences of which are shown in SEQ ID NOs: 18 and 19, respectively) in place of the wild 
type TetR sequences in pUHD141-smaL 

An expression vector encoding a TetR-Kr fusion protein (e.g., pUHD141kr-l) is 
transiently or stably transfected into host cells as described in Example 2 to express the TetR- 

10 Kr fusion protein in the host cell. A reporter gene construct containing one or more tetO 

sequences, a minimal promoter and a reporter gene, such as luciferase, is also transfected into 
the cells as described in Example 2. (Reporter gene constructs are described in further detail 
in USSN 08/076,726 and Gossen, M. and Bujard, H. (1992) Proc. Natl Acad Sci. USA 
£2:5547-555 1). Luciferase activity in the presence and absence of increasing concentrations 

15 of Tc or an analogue, e.g., doxycycline, is measured as described in Example 2. For the wild 
type TetR-Kr fusion protein described above, the transcriptional inhibiting ability of the 
fusion protein is determined by comparing the amount of luciferase activity in the presence of 
doxycycline (no repression) to the amount of luciferase activity in the absence of doxycycline 
(repression). The transcriptional inhibiting activity of the fusion protein can also be tested 

20 using reporter gene constructs that exhibit higher basal levels of expression (i.e., higher levels 
of expression in the presence of doxycycline) by using a reporter gene construct that contains 
additional positive regulatory elements (e.g., enhancer sequences). 

EXAMPLE 5 : Construction of a Tetracycline-Regulated Transcriptional Inhibitor 
25 Fusion Protein Comprising TetR and a v-erbA Silencer Domain 

To construct an expression vector encoding a fusion protein between TetR and a 
transcriptional silencer domain from the v-erbA oncogene product, a nucleic acid fiagment 
encoding a silencer domain from v-erbA is ligated in-frame into pUHD141sma-l as 

30 described in Example 4. A nucleic acid fragment encoding a v-erbA silencer domain suitable 
for ligation into pUHD141sma-l is amplified by the polymerase chain reaction (PCR) using a 
v-erbA cDNA as a template. Oligonucleotide primers are designed which amplify a nucleic 
acid fragment encoding amino acids 364-635 of the native v-erbA protein. The nucleotide 
and amino acid sequences of this region of v-erbA are shown in SEQ ID NO: 22 and SEQ ID 

35 NO: 23, respectively. As described in Example 4, PCR primers are designed such that the 

amplified v-erbA fragment contains restriction endonuclease sites at its 5* and 3 1 endi;, such as 
PspAI at the 5' end and BamHI at the 3 f end. After a standard PCR reaction, the amplified 
fragment and pUHD141-smal are digested with PspAI and BamHI. The amplified fragment 
is then ligated directionally into the polylinker site of pUHD141-smal using standard ligation 




conditions to create the expression vector pUHD141kr-l . Standard techniques can be used to 
isolate the desired plasmid and confirm its construction. Construction of pUHD141erb-l is 
illustrated schematically in Figure 11. 

The resultant pUHD141erb-l expression vector contains nucleotide sequences 
5 encoding a fusion protein comprising a wild type TetR linked in-frame to amino acids 364- 
635 of v-erbA. The nucleotide and amino acid sequences across the junction of the fusion 
protein are as follows: AGT GGG TCC CCG GGT CTG GAC GAC (SEQ ID NO: 27) and 
Ser-Gly-Ser-Pro-Gly-Leu-Asp-Asp (SEQ ID NO: 28). Ser-Gly-Ser corresponds to amino 
acids 205-207 of TetR, Pro-Gly are encoded by the polylinker and Leu- Asp- Asp correspond 
10 to amino acids 364-366 of the v-erbA silencer domain. 

As described in Example 4, an expression vector encoding a fusion protein of a 
mutated TetR (that binds to tetO only in the presence of Tc) and a v-erbA silencer domain 
can be constructed as described above using nucleotide sequences encoding a mutant TetR 
(the nucleotide and amino acid sequences of which are shown in SEQ ID NOs: 18 and 19, 
1 5 respectively) in place of the wild type TetR sequences in pUHD141 -smal . 

Expression of the TctR-v-erbA fusion protein in host cells and assaying of the 
transcriptional inhibiting activity of the fusion protein is as described in Example 4 for the 
TetR-Kr fusion protein. 

20 EXAMPLE 6: Regulation of Gene Expression in Transgenic Animals by tTA R 

To examine the ability of tTA R to regulate gene expression in vivo, transgenic strains 
of mice were constructed which contained heterologous chromosomal insertions of either a 
tTA R expression construct or a reporter gene operably linked to tet operators. Single 

25 transgenic strains containing either a tTA R expression construct or the tefO-linked reporter 
gene were then cross bred and double transgenic progeny were identified. The double 
transgenic animals were then characterized as to the ability of tTA R , in a tetracycline 
dependent manner, to regulate expression of the reporter gene. This example demonstrates 
that tTA R effectively stimulates the expression of a gene operably linked to tet operators in 

30 tissues of the animals in vivo upon administration of tetracycline (or analogue) to the animals, 
whereas expression of the tefO-linked gene remains at background levels in the absence of 
tetracycline or an analogue. These results demonstrate that the tetracycline-controlled 
transcriptional regulatory system described herein functions effectively in animals, in 
addition to cell lines in vitro. 

35 

Generation of m ice transeen- for a P hCMV-tTA R expression unit 

Mice expressing tTA protein were obtained by pronuclear injection into fertilized 
oocytes of a 2.7kb Xhol-Pfml fragment excised from plasmid pUHG17-l. This DNA 
fragment contained the tTA R gene (shown in SEQ ID NO: 1) under the transcriptional control 




of the human CMV IE promoter (position +75 to -675) together with a rabbit p-globin 
polyadenylation site including an intron. The human CMV IE promoter is a constitutive 
promoter that allows expression of the mutated tetR-VP16 fusion protein in all cells where 
chromosomal integration of the DNA sequence encoding tTA R has occurred. DNA was 
5 injected into fertilized oocytes at a concentration of approximately 5 ng per p,l by standard 
techniques. Transgenic mice were generated from the injected fertilized oocytes according to 
standard procedures. Transgenic founder mice were analyzed using polymerase chain 
reaction (PCR) and Southern hybridization to detect the presence of the tTA R transgene in 
chromosomal DNA of the mice. Two transgenic mouse lines, CR3 and CR4 were identified 
10 and crossbred with another transgenic mouse line carrying a luciferase reporter gene under 
the control of tetO sequences, (described further below). 

Generation of mice transgenic for the PhCMV*-! luciferase reporter unit 

Mice carrying a PhCMV* -1 l uc reporter gene expression unit were generated by 

15 pronuclear injection into fertilized oocytes of a 3.1 kb Xhol-Eael fragment excised from 

plasmid pUHC13-3. This DNA-fragment contains the luciferase gene under transcriptional 
control of the tetracycline-responsive PhCMV*-l promoter (SEQ ID NO: 8), together with an 
SV40 t early polyadenylation site including an intron. DNA was injected into oocytes at a 
concentration of approximately 5 ng per jj.1 and transgenic mice were generated according to 

20 standard procedures. Transgenic founder mice were analyzed using Southern hybridization 
to detect the presence of the PhCMV*-l luc transgene in chromosomal DNA of the mice. A 
mouse line transgenic for the /e/O-linked luciferase reporter gene, L7, was crossbred with the 
tTA R transgenic lines CR3 and CR4 (described further below). 

25 Generation of mice transgenic for the PhCMV*-! luc and PhCMV- tlA R 

Having constructed single transgenic mice expressing tTA R or carrying PhCMV*- 1 
luc, double transgenic mice carrying both the tTA expression vector and the luciferase 
reporter-units were obtained through cross breeding of heterozygous mice transgenic for one 
of the two transgenes. Double transgenic animals were identified by standard screenings 

30 (e.g., PCR and/or Southern hybridization) to detect the presence of both the tTA R transgene 
and the PhCMV*- 1 luc transgene in chromosomal DNA of the mice. 

Induction and analysis of luciferase activity in tissue samples from mice 

For oral administration, tetracycline or its derivative doxycycline were given in the 
35 drinking water at a concentration of 200 ng per ml with 5 % sucrose to hide the bitter taste of 

the antibiotics. For lactating mice, the concentration was 2 mg per ml with 10 % sucrose to 

ensure a sufficient uptake via the milk by the young. 

To analyze luciferase activity, mice were killed by cervical dislocation and tissue 

samples were homogenized in 2 ml tubes containing 500 |il lysis-buffer (25 mM Tris 




phosphate, pH 7.8/2 mM DTT/ 2 mM EDTA/ 10 % glycerol/ 1 % Triton X100) using a 
Ultra-Turrax. The homogenate was frozen in liquid nitrogen and centrifuged after thawing 
for 5 min at 15,000g. 2-20 jal of the supernatant were mixed with 250 jal luciferase assay 
buffer (25 mM glycylglycine, pH 7.5/ 15 mM MgS04/ 5 mM ATP) and luciferase activity 
5 was measured for 10 sec after the injection of IOOjj.1 of a 125 ^iM luciferin solution using 
Berthold Lumat LB 9501 . The protein concentration of the homogenate was determined 
using Bradford assay and luciferase activity was calculated as relative light units (rlu) per (ig 
of total protein. 

10 Results 

Mice from 2 lines carrying the PhCMV _tTAR transgene (CR3 and CR4) were mated 
with mice from line L7, transgenic for PhCMV*-l luc - The L7 line shows a very low but 
detectable background of luciferase activity in different organs that is probably due to 
position effects at the integration side. The background luciferase activity in different tissues 

1 5 of the L7 single transgenic mice is illustrated graphically in Figure 12, represented by the 

checked columns on the right-hand side for each tissue examined (each column represents the 
results from one animals). The luciferase activity in different tissues of the C3/L7 double 
transgenic mice in the absence of the tetracycline analogue doxycycline (i.e., uninduced 
conditions) is illustrated graphically in Figure 12, represented by the dark columns in the 

20 middle for each tissue examined. The luciferase activity in different tissues of the C3/L7 

double transgenic mice in the presence of doxycycline (i.e., induced conditions) is illustrated 
graphically in Figure 12, represented by the light columns on the left-hand side for each 
tissue examined. 

Luciferase activity was detectable in the seven tissues of the double transgenic mice 
25 examined: pancreas, kidney, stomach, muscle, thymus, heart and tongue. The tissue pattern 
of activated luciferase levels (i.e., in the presence of doxycycline) in the double transgenic 
mice was similar to expression patterns of the hCMV IE promoter reported in the literature. 
This is consistent with expression of the luciferase reporter gene being regulated by tTA R 
(which is expressed in the mice under the control of the hCMV IE promoter). The level of 
30 reporter gene induction varied among the different tissues examined. Regulation factors up 
to 100,000 fold (i.e., 5 orders of magnitude) were achieved, e.g. in the pancreas. 

EXAMPLE 7: Combinatorial Regulatory Schemes using 

Tetracycline-Regulated Fusion Proteins 

35 

In this example, the ability to use the various tetracycline-regulated transcriptional 
activators and inhibitors described herein in combination, e.g., to regulate expression of two 
gene expression in a single cell using two different transactivators or one gene in a cell using 
both a transactivator and an inhibitor, was investigated further. In a first series of 




experiments, various Tet repressors having mutations within the DNA binding domain of the 
protein were examined for their ability to bind either a wild type tetO sequence or various 
tetO variants to select for Tet repressors having specificity for variant tetO sequences. In a 
second series of experiments, Tet repressors having specificity for different variant tetO 
5 sequences were converted into either a tTA (which binds to its target tetO in the absence of 
Tc) or a tTA R (which binds to its target tetO in the presence of Tc) and used to regulate gene 
expression in host cells. In a third series of experiments (described in Part C below), various 
combinations of different classes of Tet repressors were examined for their ability to 
heterodimerize to identify classes of Tet repressors which do not heterodimerize. Finally in 
10 Part D, below, the results from the three preceding analyses are combined to present preferred 
model systems for 1) regulating two /e/O-linked genes in one cell using a tTA and a tTA R 
having different variant tetO specificities and 2) regulating the expression of one te/O-linked 
gene using both a Tc-regulated transcriptional activator and a Tc-regulated transcriptional 
inhibitor. 

15 

A. Tet Operator Binding Specificity of Tet Repressors Mutated within the 
PNA Binding D omain 

A series of amino acid substitutions were made in the TnlO Tet repressor within a 

20 region of the protein constituting the DNA binding domain (DBD) (i.e., the region of the 
protein which is thought to interact with a target tetO sequence). Amino acid substitutions 
were introduced into TetR by standard mutagenesis of a DNA molecule encoding the TetR. 
The ability of a DBD-mutated TetR to bind either a wild-type B class tetO or mutated 
variants thereof was then analyzed by introducing an expression vector encoding the DBD- 

25 mutated TetR into E. coli, together with a reporter gene comprising a P-galactosidase gene 
operatively linked to either a wild-type tetO sequence (referred to as "4T") cr tetO sequences 
having a nucleotide substitution at either the 4 or 6 position ("4C", M 6C", "4A" or M 4G M ). The 
P-galactosidase activity in the cells was measured in the absence or presence of the TetR 
construct. The ability of the TetR protein to bind to the tetO sequence was determined by 

30 examining the % repression of P-galactosidase activity in the cell. 

In a first series of experiments, a proline to glutamine mutation was introduced at 
position 39 ("39PQ"), alone or together with additional mutations at positions 37, 41 and/or 
42. The results of these analyses are summarized below in Tables 2 and 3. In these tables, p- 
galactosidase activity in the absence of TetR is standardized as 100 %. In the presence of a 

35 TetR that can bind the target tetO sequence linked to the reporter gene, P-galactosidase 
activity is reduced (e.g., in the presence of a wild-type TetR that can bind its target tetO 
sequence, p-galactosidase activity typically is less than 1 %). In contrast, in the presence of a 
DBD-mutated TetR that cannot bind the target tetO sequence linked to the reporter gene, p- 
galactosidase activity remains near 100 %. 




Table 2. 





tet Operator 


Tet Repressor 


4C 


4T (wt) 


6C 


4A 


4G 


wt 


57.3 ± 1.6 


0.1 ±0.1 


47.1 ±2.7 


28 ± 0.7 


88.3 ±2.1 


39PQ 


46.7 ± 1.2 


96.4 ±0.1 


88.4 ±4.5 


97.3 ± 1.2 


99.2 ± 4.2 


37ES 


9.3 ± 0.3 


0.3 ±0.1 


35 ±2.0 


6±0.1 


64.2 ±2.2 


37ES39PQ 


6.2 ± 0.5 


92.5 ± 2 


100.2 ± 2.3 


95.8 ± 1.1 


99.1 ±3.9 


37ES39PQ42YM 


8.4 ± 0.3 


96.7 ± 0.8 


95.6 ±1.1 


98.8 ± 3 


99.6 ± 3.4 


37ES39PQ41LV 


14.7 ± 1 


95.1 ±0.8 


97.6 ± 2.8 


102 ±5.3 


96.2 ±3.1 


37ES39P041LV42YM 


0,2 + 0,1 


87,9 + OJ 


?U+2,2 


74.5 ± 3.4 


88.2 ± 1.9 


37ES39PQ41U42YM 


0.7 + 0.2 


87.7 ±2.9 


98,4 + 1.2 


79.4 ±3.4 


87.2 ± 0.6 


37ES39PQ41LV42YS 


99.9 ± 3.3 


98.4 ±2.1 


98.8 ± 1.9 


99.8 ± 1.6 


100.1 ±2.6 


37ET 


13.7±0.6 


0.2 ±0.1 


40.9 ± 2.9 


8.1 ±0.6 


87.4 ± 2.5 


37ET39PQ 


4.8 ± 0.2 


93.5 ± 3.4 


99.4 ± 2.2 


102.3 ±2.7 


100.4 ± 1.7 


37JET39PQ42YM 


1,4 + 0 


93,8 + 6,2 


100,9 + 3.9 


103 ± 1.3 


92.2 ± 0.6 


37ET39PQ41LV 


4.3 ± 0.2 


92 ± 0.9 


100 ±0.2 


100.2 ±4.2 


97.9 ± 1.1 


37ET39P041LV42YM 


0.1 ± 0.1 


95,8 + 0,1 


104,2 + 2,7 


93.1 ± 1.4 


91 ±4.1 


37EL 


82 ± 0.4 


0.4 ±0.1 


72.5 ±4.8 


84.8 ±2.5 


101.6 ± 1.9 


37EL 39PQ 


58.8 ±0.5 


95.6 ±2.2 


98.3 ±4 


98.9 ±2.7 


102.4 ±4.1 


37EL39PQ42YM 


8.2 ± 0.5 


96.7 ± 0.3 


99 ± 4.5 


95.4 ±2.5 


97 ±3 


37EL39PQ4ILV 


73.9 ±0.2 


95.3 ± 0.4 


98.2+2.5 


94.5 ± 0.9 


98.1 ±4.9 


37EL39PQ41LV42YM 


20.5 ±0.1 


92.3 ± 1.2 


95.6 ± 2.9 


96.1 ±0.6 


102.2 ±4.8 


37EL39PQ41LM42YM 


0,7 + 0,1 


96.3 + 2,2 


98.2 + 2,5 


96.9 ± 1.2 


102.6 ±0.5 


39PQ41LM 


90.1 ±2.1 


97.2 ± 3.4 


99.4 ±3.4 


96.5 ±2.8 


96.9 ± 2.2 


39PQ42YM 


71.1 ± 1.8 


95.6 ±3.3 


100.6 ± 1.6 


97 ± 1.7 


97.1 ±8.2 


39PQ41LM42YM 


72.3 ± 0.7 


98.6 ± 5.7 


99.3 ± 0.4 


93.5 ±4.8 


99.1 +4.5 


41LM 


94 + 0.8 


1.1 ±0 


95.4 ± 5 


97.7 ± 1.1 


103.1 ±2 


42YM 


83.4 ±0.2 


20.2 ± 1 


99.8 ± 3 


100.3 ±2.6 


100.4 ±2.5 


41LM42YM 


88.5 ± 0.4 


81.2±4 


103.4 ± 1.7 


102.2 ±4.7 


99.2 ± 9.6 


37EM 


16.8 ± 1.4 


1 ±0.1 


19.8 ±0.5 






37EM39PQ42YM 


7.5 ± 0.9 


103.3 ± 7 


99.3 ± 1.3 






37EM39PQ41LV42YM 


2.4 ± 0.3 


102.9 ± 2.4 


103.7 ± 3 






37EV 


7.9 ± 1.1 


0.6 ±0 


40.3 ±4.2 






37EV39PO 


0.5 ±0,1 


93.2 + 1.4 


96.7 ±0.1 






37EV39PQ42YM 


0.2 ±0.1 


97.8 ± 0.8 


95.5 + 3-5 






37EV39P041LV42YM 


0,1+0 


92.5 ±1.8 


97 f 4 + 0,2 






37EV39PQ41LT42YM 


4.6 ± 0.9 


93 ± 2.2 


95.4 ± 0.2 






37EF 


81.3 ±0.1 


1.6 ±0.2 


6.4 ± 1.2 






37EF39PQ 


24.1 ±3.8 


96.6 ±3.1 


100 ±2.2 






37EF39PQ42YM 


10.4 ± 1 


99.5 ± 4.7 


101.6 ± 1.3 






37EF39PQ41LV42YM 


11.8 ± 1.2 


100 ±8 


100 ±3.7 






37EW 


86.7 ± 1.4 


2.8 ±0.3 


25.9 ± 2.3 






37EW39PQ 


51.8 ± 1.4 


94.5 ±2.6 


100.2 ±0.1 






37EW39PQ42YM 


40.7 ± 0 


103.2 ±3.4 


103.1 ±3.9 






37EW39PQ41LV42YM 


40.8 ±3 


99.6 ± 8.6 


101.2±2 






37EI 


94.1 ± 1.8 


3.1 ±0.9 


100.1 ±2.1 






37EI39PQ 


43.8 ± 1.6 


94.2 ± 1.2 


96.4 x 0.2 






37EI39PQ42YM 


2.4 ± 0.2 


93.6 ± 1.8 


96.6 ± 2.4 






37EI39PQ41LV42YM 


16.4 ± 1.2 


99.2 ± 1.6 


98.1 ±0.3 






37EH 


61 ± 1.8 


0.7 ±0.1 


2.2 ± 0.5 






37EH39PQ 


3.1 ±0.2 


76.5 ± 2.7 


99.2 ± 0.9 






37EH39PQ42YM 


15.7 ±0.7 


94.7 ± 0.8 


98.6 ± 1.7 






37EH39PQ4 1 LV42YM 


15.5 ± 1.8 


99.7 ± 1.9 


101.8 ±4.4 








37ER 


7 ±0.6 


0.8 ±0.1 


22.6 ±2.3 






37EB39PO 


0.3 + 0.1 


95.6 A 0.8 


95.S±3.3 






37ER39P041LV42YM 


O.K ± 0.X 


95.3 ±1,9 


95.6 ±1,9 







Table 3. 





tet Operator 


Tet Repressor 


4C 


6C 


37ES39PQ41LI42YF 


17.2 ± 1.1 


83.1 ±0.7 


37ES39PQ41LA42YF 


31.3 ± 1 




37ES39PQ41LP42YW 


29.8 ± 0.5 




37EA 


8.5 ± 0.5 


20.4 ± 0.3 


37EA39PQ 


1.5 + 0,1 


94.5 ±5,1 


37EA39P042YM 


1.6 + 0.3 


92.6 ±1.4 


37EA39P041LV42YM 


1.4 + 0.2 


98.3 ±1.4 


37EA39PQ41LM42YI 


47.7 ± 9.2 




37EA39PQ41LA42YL 


17.4 ±0.5 




37EA39PQ41LA42YW 


24.9 ± 0.4 




37EA39PQ41LW42YF 


38.5 ±0.5 




37EV39PQ41LV42YC 


8.3 ± 0.2 




37EV39PQ41LI42YL 


26.9 ± 0.5 




37EV39PQ41LM42YL 


26.3 ± 0.3 




37ER39PQ41LV42YL 


22.7 ± 0.3 




39ER39PQ41LM42YL 


20.9 ± 0.8 




37ER39PQ41LA42YW 


35.9 + 1.7 




37ER39PQ41LI42YL 


21.8 ± 1.6 




37EK 


7.2 ± 0.4 


20.8 ± 0.3 


37EK39PQ 


2.2 ± 0.5 


101.3 ± 0.4 


37EK39PQ42YF 


10.8 ±0.4 




37EK39PQ41LT42YM 


2 ±0.1 


103.4 ±3 


37EK39PQ41LW 


10.2 ±0.3 




37EI39PQ41LF42YM 


60.3 ± 0.5 




37EQ 


14.5 ±0.1 


13.2 ±0.8 


37EQ39PQ 


2.7 ± 0.3 


102.7 ± 1.6 


37EQ39PQ42YM 


6.1 ± 0.2 


104.3 ± 2.3 


37EQ39PQ41LT42YM 


21.2 ±0.6 




37EG 


89.8 ±0.5 




37EG39PQ 


78.8 ±0.6 




37EG39PQ42YM 


70.8 ± 1.5 




37EG39PQ4 1 LV42YM 


64.9 ± 1.1 




37EP 


97.5 ± 7.2 




37EP39PQ 


87.1 ±4.8 




37EP39PQ42YM 


64.6 ± 1.2 




37ED 


96.8 ±6.1 




37ED39PQ 


106.2 ± 1.9 




37ED39PQ4 1 LV42YM 


88.2 ± 1.8 




37EC 


15.3 ±0.3 




37EC39PQ41LA42YM 


1.1 ±0.1 




37EY 


88.8 ± 1.3 




3 7E Y3 9PQ4 1 L A42 YM 


16.8 ± 1.4 




37EN 


102.2 ±8.5 




37EN39PQ 


47.9 ±0.5 




39PQ42YM 


71.1 ±1.8 




39PQ41LV42YM 


49.4 ± 1.2 






The results shown in Tables 2 and 3 demonstrate that the DBD-mutant TetR having 
only the 39PQ mutation cannot bind the wild-type (4T) tetO sequence (fi-galactosidase 
activity remains at approximately 96 %, compared to 100 % activity in the absence of the 
TetR). Furthermore, a series of mutants having, in addition to the 39PQ mutation, further 
5 mutations at positions 37, 41 and/or 42 were identified that had the following characteristics 
1) an ability to efficiently bind to the "4C" tetO variant and 2) an inability to bind to the "6C" 
tetO variant. Such mutants that could inhibit expression of a 4C te/O-linked reporter gene by 
at least 90% (i.e., p-galactosidase activity is less than or equal to 10 %) are indicated in bold 
type in Tables 2 and 3 above. These mutants include: 37ES39PQ, 37ES39PQ42YM, 

10 37ES39PQ41LV42YM, 37ES39PQ41LI42YM, 37ET39PQ, 37ET39PQ42YM, 

37ET39PQ41LV, 37ET39PQ41LV42YM, 37EL39PQ42YM, 37EL39PQ41LM42YM, 
37EM39PQ42YM, 37EM39PQ41LV42YM, 37EV39PQ, 37EV39PQ42YM, 
37EV39PQ41LV42YM, 37EV39PQ41LT42YM, 37EI39PQ42YM, 37EH39PQ, 37ER39PQ, 
37ER39PQ41LV42YM, 37EA39PQ, 37EA39PQ42YM, 37EA39PQ41LV42YM, 

15 37EK39PQ, 37EK39PQ41LT42YM, 37EQ39PQ and37EQ39PQ42YM. Even more preferred 
mutants that could inhibit expression of a 4C te/O-linked reporter gene by at least 98% (i.e., 
P-galactosidase activity is less than or equal to 2 %) are indicated in bold type and underlined 
in Tables 2 and 3 above. These mutants include: 37ES39PQ41LV42YM, 
37ES39PQ41LI42YM, 37ET39PQ42YM, 37ET39PQ41LV42YM, 37EL39PQ41LM42YM, 

20 37EV39PQ, 37EV39PQ42YM, 37EV39PQ41LV42YM, 37ER39PQ, 

37ER39PQ41LV42YM, 37EA39PQ, 37EA39PQ42YM, 37EA39PQ41LV42YM and 
37EK39PQ41LT42YM. 

In a second series of experiments, a preferred three-position mutation identified 
25 above, 37EA39PQ42YM, was chosen for further analysis. The binding specificities of one-, 
and two-position mutants (i.e., 37EA, 42YM, 37EA39PQ and 37EA42YM) were compared 
to the three-position mutant (37EA39PQ42YM) using an expanded series of tetO variants 
having nucleotide substitutions at either the 3, 4, 5 or 6 position. The results of these studies 
are summarized below in Tables 4 and 5. 
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Table 4. 



tet02 


tef/? Allele 


EA37 




EA37PQ39 


EA37PQ39YM42 






WT 




Wl 


0 


± 


0 


< 


67 


± 


2 


102 


± 


0 




0 


± 


0 


3C 


6.1 


± 


1.2 


< 


88 


± 


4 


< 104 


± 


3 


> 


45 


± 


1 




54 


± 


1 


< 


99 


± 


3 


114 


± 


11 




98 


± 


2 


3T 


31 


± 


2 


< 


99 


± 


1 


108 


± 


7 


> 


76 


± 


3 


4A 


0.3 


± 


0.1 


< 


74 


± 


1 


< 92 


± 


3 


> 


6.0 


db 


0.5 


4C 


0.5 


± 


0.1 




0 


± 


0 


0 


± 


0 


< 


31 


± 


2 


4G 


16 


± 


1 


< 


83 


± 


0 


< 96 


± 


1 


> 


69 


± 


2 


5C 


2.6 


± 


0.6 


< 


16 


± 


0 


< 55 


± 


0 


> 


1.6 


± 


0.1 


5G 


46 


± 


2 


< 


104 


± 


2 


100 


± 


4 




94 


± 


2 


5T 


0.4 


± 


0.1 


< 


103 


± 


3 


109 


± 


3 


> 


5.3 


± 


1.3 


6A 


0 


± 


0 


< 


100 


± 


5 


105 


± 


0 


> 


0.1 


± 


0 


6C 


6 


± 


0 


< 


102 


± 


2 


101 


± 


1 


> 


18 


± 


1 














P-Galactosidase Activity in 


[%] 













Table 5. 



tet02 


tetR Allele 


EA37 


YM42 


EA37YM42 






WT 




wt 


0 


± 


0 


4.4 


± 


1.4 


0 


± 


0 




0 


± 


0 


3C 


6.1 


± 


1.2 


98 


± 


2 


98 


± 


5 


> 


45 


± 


1 


3G 


54 


± 


1 


101 


± 


2 


106 


± 


3 




98 


± 


2 


3T 


31 


± 


2 


102 


± 


2 


109 


± 


2 


> 


76 


± 


3 


4A 


0.3 


± 


0.1 


68 


± 


5 


4.1 


± 


0.1 


< 


6.0 


± 


0.5 


4C 


0.5 


± 


0.1 


60 


± 


4 


> 0.2 


± 


0.1 


< 


31 


± 


2 


4G 


16 


± 


1 


91 


± 


1 


32 


± 


1 


< 


69 


± 


2 


5C 


2.6 


± 


0.6 


48 


± 


3 


> 1.2 


± 


0 




1.6 


± 


0.1 


5G 


46 


± 


2 


101 


± 


2 


105 


± 


2 




94 


± 


2 


5T 


0.4 


± 


0.1 


80 


± 


3 


0.3 


± 


0 


< 


5.3 


± 


1.3 


6A 


0 


± 


0 


60 


± 


0 


21 


± 


2 


> 


0.1 


± 


0 


6C 


6 


± 


0 


93 


± 


1 


48 


± 


0 


> 


18 


± 


1 












P-Galactosidase Activity in 


[%] 













The results of this analysis indicate that some of the one- and two-positions mutants, 
while retaining the ability to bind the 4C variant (like the three-position variant), have a 
broader binding specificity than the three-position variant. For example, 37EA42YM 
efficiently inhibits P-galactosidase activity of the 4C variant (0.2 % activity), but also inhibits 
10 the activity of the 4 A variant (4.1 % activity), the 5C variant (1.2 % activity) and the 5T 
variant (0.3 % activity). In contrast, 37EA39PQ42YM efficiently inhibits p-galactosidase 




activity of the 4C variant (0 % activity), but does not efficiently inhibit the activity of the 4A 
variant (92 % activity), the 5C variant (55 % activity) or the 5T variant (109 % activity). 

In a third scries of experiments, a panel of TetR mutants having a substitution at 
5 positions 43 and/or 44, alone or together with additional mutations at positions 37, 40 and/or 
41, were made and tested as described above. The results of these analyses are summarized 
below in Table 6. 

Table 6. 



Operator Variant 




6C 






Wt 






4C 






6A 






5G 




pWH1201 (Control) 


100% 




6 


100% 


± 


1 


100% 


+ 


6 


100% 


+ 


7 


100% 


± 


3 


pWH520 (WT TetR) 


24% 


i 


1 


0.2% 


± 


0.1 


38% 


+ 


3 


0% 


+ 


0.2 


5% 


± 


1 


43WR 


68% 


+ 


2 


103% 


+ 


3 


95% 


+ 


1 


94% 


+ 


3 


90% 


+ 


2 


44 HA 


101% 


± 


2 


91% 


+ 


6 


104% 


+ 


5 


96% 




2 


105% 


± 


3 


43WR44HA 


28% 


+ 


0.5 


105% 


± 


7 


102% 


± 


4 


91% 


± 


2 


90% 


± 


7 


41LI43WR44HA 


35% 


+ 


1.5 


101% 


± 


2 


93% 


+ 


3 


97% 


± 


4 


93% 


± 


1 


44 HV 


97% 


+ 


3 


87% 


+ 


2 


107% 


+ 


5 


97% 


± 


2 


106% 


± 


7 


43WR44HV 


36% 


+ 


1.8 


101% 


+ 


3 


95% 




3 


91% 


+ 


4 


97% 


± 


3 


41LI43WR44HV 


23% 


± 


1 


96% 


± 


4 


94% 


± 


4 


83% 


± 


6 


92% 


± 


4 


40TR4 1 LI43 WR44HV 


27% 


± 


1 


105% 


± 


4 


94% 


± 


3 


92% 


± 


5 


91% 


± 


4 


41LI43WR44HT 


25% 


± 


1 


100% 


± 


3 


97% 


± 


2 


96% 


± 


2 


94% 


± 


3 


37ES 


10% 


± 


0.3 


0% 


± 


0.1 


1.7% 


± 


0.2 


0% 


± 


0 


0% 


± 


0.1 


37ES43WR44HN 


3.8% 


± 


03 


80% 


± 


3 


97% 


± 


5 


104% 


± 


3 


92% 


± 


3 


37ES43WR44HV 


5.5% 


± 


0.2 


56% 


± 


2 


94% 


± 


3 


80% 


± 


3 


92% 


± 


0.5 


41LV43WR44HV 


87% 


± 


2 


102% 


± 


2 


93% 


± 


3 


108% 


± 


3 


95% 


+ 


3 


37ES4ILV43WR44HV 


24% 


± 


0.7 


77% 


± 


4 


92% 


± 


3 


103% 


± 


3 


97% 


± 


1 




The results shown in Table 6 demonstrate that DBD-mutant TetRs having an ability to 
efficiently bind to the "6C" tetO variant but an inability to bind efficiently to the "4C" tetO 
variant can be identified. Mutants that could inhibit expression of a 6C te/O-linked reporter 
5 gene by at least 94% (i.e., p-galactosidase activity is less than or equal to 6 %) are indicated 
in bold type in Table 6 above. The two preferred mutants are 37ES43WR44HN (only 3.8 % 
activity with the 6C variant but 97 % activity with the 4C variant) and 37ES43WR44HV 
(only 5.5 % activity with the 6C variant but 94 % activity with the 4C variant). The most 
preferred mutant is 37ES43WR44HN. 

10 

B. Use of tTA and tTA R transactivators with different tetO binding specificities to regulate 
gene expression in cells 

1 5 Tet repressor mutants having different DNA binding specificities, as described above 

in Part A, were used to construct expression vectors encoding modified transactivator fusion 
proteins. One modified construct encoded a TetR having the DBD mutations 
37EA39PQ42YM in an otherwise wild-type sequence, fused to VP 16. This construct is 
referred to as tTA 4 . This construct binds to the 4C tetO variant but not the 6C tetO variant 

20 and is regulated by Tc in the same manner as a wild-type TetR. Thus, this tTA 4 

transactivator binds to 4C tetO sequences (but not 6C tetO sequences) in the absence, but not 
the presence, of Tc. Another modified construct encoded a TetR having the DBD mutations 
ES37WR43HN44, which confer the ability to bind the 6C tetO variant but not the 4C tetO 
variant, and, additionally, the mutations 71EK95DN101LS102GD, which confer the 

25 "reverse" Tc-regulated phenotype, fused to VP 16. This construct is referred to as tTA R 6 . 
Thus, this tTA R 6 construct binds to 6C tetO sequences (but not 4C tetO sequences) in the 
presence, but not the absence, of Tc. 

Furthermore, the te/O-linked luciferase reporter gene construct PhCMV*-l was 
modified to contain either the 4C or the 6C mutation. These modified reporter gene 

30 constructs are referred to herein as tefOc4 and tetOQ 6 , respectively. 

To examine the ability of the tTA4 and tTA R $ expression constructs to regulate 
expression of the tefOc4 ^d te'c6 reporter gene constructs within cells, various combinations 
of transactivator expression constructs and reporter gene constructs were cotransfected into , 
HeLa cells in transient transfection assays. The luciferase activity in cell extracts prepared 

35 from transfected cells cultured either in the presence or absence of doxycycline was then 
examined. Representative result:: are illustrated graphically in Figure 13. 

The first pair of bars in Figure 13 demonstrate that tTA 4 efficiently regulates 
expression of tetOQ4 (i.e., low luciferase activity in the presence of doxycycline but -1000- 
fold increase in luciferase activity in the absence of doxycycline). The second pair of bars in 




Figure 13 demonstrate that tTA R g efficiently regulates expression of tetOQ 6 (i.e., low 
luciferase activity in the absence of doxycycline but -100-fold increase in luciferase activity 
in the presence of doxycycline). Finally, the third and fourth pairs of bars in Figure 13 
demonstrate that when both the tTA4 and tTA R $ constructs are introduced into a single host 
5 cell, their binding specificity for their respective operators, tetOc4 and tetOQ& is maintained 
(i.e., they do not exhibit crossactivation of a non-target tetO sequence). Thus, when tTA 4 and 
tTA R 6 are both introduced into cells together with the tefOc4 reporter gene, expression of the 
reporter gene is regulated in a manner identical to when tTA 4 alone is introduced into the cell 
(i.e., high luciferase activity only in the absence of doxycycline). Similarly, when tTA 4 and 

10 tTA R 6 are both introduced into cells together with the tetO C6 reporter gene, expression of the 
reporter gene is regulated in a manner identical to when tTA R 6 alone is introduced into the 
cell (i.e., high luciferase activity only in the presence of doxycycline). Additionally, when 
tTA 4 alone is cotransfected with tetOQ^ or tTA R $ alone is cotransfected with te/Oc 4 no effect 
of the transactivators is observed. These various results demonstrate the strict DNA binding 

1 5 specificty of the tTA 4 and tTA R 6 fusion proteins and, moreover, demonstrate that the DNA 
binding specificity of an unfused DBD-mutated Tet repressor is maintained when the DBD- 
mutated Tet repressor is incorporated into a transcriptional activator fusion protein. 

C. Dimerization Specificity of Different Classes of Tet Repressors 

20 

Tet repressors can be placed into seven different classes called A to E ? G and H. Each 
repressor protein typically recognizes its operator sequences as a dimer. To establish an 
efficient combination regulatory system, in which two (or more) Tc-regulated fusion proteins 
are introduced into the same cell, it may be preferable to use classes of TetR that do not 

25 heterodimerize with each other (e.g., a transactivator fusion protein of one class and a 
transcriptional inhibitor fusion protein of another class). To determine the dimerization 
specificity of TetRs of different classes, in vitro dimerization assays were performed. In this 
assay, two purified TetR which are distinguishable by their electrophoretic mobilities are 
mixed in a solution to avoid oxidation of the proteins and incubated for ten minutes at 50 °C, 

30 followed by a 30 minute incubation at °4 C. The protein mixture is then analyzed by native 
polyacrylamide gel electrophoresis. 

A deletion mutant of the TnlO-encoded class B TetR (A26-53) was used as one 
protein in the mixture. This deletion mutant has an increased electrophoretic mobility 
compared to the full-length class B, C, D or E repressors tested. Thus, heterodimers between 

35 the deletion mutant and a full-length TetR can be detected by PAGE. When the class B 

deletion mutant was incubated with either a full-length class B TetR or a full-length class C 
tetR, heterodimers between the deletion mutant and the full-length repressor were observed. 
In contrast, when the class B deletion mutant was incubated with either a full-length class D 




TetR or a full-length class E TetR, no heterodimer between the deletion mutant and the full- 
length repressor were observed. 

To confirm these in vitro findings, in vivo transdominance experiments were 
performed. In these assays, a TetR expression construct was introduced into an E. coli strain 
5 that carries a tefO-linked P-galactosidase reporter gene, either alone or together with an 
expression construct for the class B TetR deletion mutant described above (A26-53), and p- 
galactosidase activity was measured. In this assay, the level of p-galactosidase activity in the 
absence of any TetR constructs was standardized as 100 %. The presence of a full-length 
TetR construct alone results in decreased p-galactosidase activity. When a full-length TetR 

10 and the deletion construct are coexpressed in the cells, the deletion mutant cannot bind to 

tetO sequences itself but, if it is able to heterodimerize with the full-length TetR, the deletion 
mutant will reduce the ability of the full-length TetR to repress p-gal activity. Thus, if the 
deletion mutant and the full-length TetR heterodimerize, P-gal activity is higher in the 
presence of the deletion mutant than in the absence of the deletion mutant. The results of 

1 5 these experiments are summarized below in Table 7. 



Table 7 



Tet 
Repressor 


Level of 
TetR Expression 


% P-galactosidase activity 


Factor of 
Transdominance 


-A26-53 


+A26-53 


none 




100 


100 




wt class B 


low 


1.7 (±0.6) 


37.3 (± 3.5) 


22 


wt class B 


high 


1.2 (±0.2) 


2.4 (±0.1) 


2 


wt class C 


low 


84.5(± 1.3) 


100.6 (±5.1) 


1.2 


wt class C 


high 


6.4 (± 0.0) 


12.8 (± 1.2) 


2 


wt class D 


low 


62.2 (± 3.5) 


65.4 (± 2.0) 


1.1 


wt class D 


high 


1.4 (±0.0) 


1.5 (±0.0) 


1.1 


wt class E 


low 


40.5 (± 1.2) 


41.1 (±3.8) 


1.0 


wt class E 


high 


1.2 (±0.1) 


1.2 (±0.1) 


1.0 



The results shown in Table 7 confirm that the full-length class B TetR and the class B 
20 deletion mutant can heterodimerize (e.g., at low levels of TetR expression, P-galactosidase 
activity increases 22-fold in the presence of the deletion mutant). Similarly, these 
experiments indicate that the class B deletion mutant can heterodimerize with the full-length 
class C TetR (e.g., at high levels of TetR expression, P-galactosidase activity increases 2-fold 
in the presence of the deletion mutant). In contrast, no evidence of heterodimerization 
25 between the class B deletion mutant and either the full-length class D or class E TetR was 
observed. 

To confirm that results shown in Table 7 were not simply a specific effect due to the 
particular deletion mutant used, similar experiments were performed with two additional 




deletions mutants (TetR(B)A9-l 1 and TetR(B)A83-87). Similar results were observed with 
the two additional deletion mutants, confirming that the lack of heterodimerization between 
class B TetR and either class D TetR or class E TetR was a general phenomenon that did not 
depend on which particular deletion mutant was used. 
5 Thus, based on in vitro and in vivo assays, a class B TetR does not efficiently 

heterodimerize with either a class D TetR or a class E TetR. 

D. Model Reg ulatory Systems for Combination Regulation of Genes 

10 The results discussed in the preceding sections can be combined to create preferred 

model schemes for various combinatorial regulatory systems. In the first combinatorial 
regulatory system, a single gene is coordinately regulated by a reverse transactivator fusion 
protein and a transcriptional inhibitor fusion protein. This scheme is illustrated in Figure 14, 
Model A. In this system, at least one tetOAC variant sequence is operatively linked to a 

1 5 minimal promoter (containg a "TATA" motif) and a gene of interest ("X"). Regulation of 
gene X in a cell is mediated, in part, by a "reverse activator" composed of a class B TetR 
having the DBD mutations 37EA39PQ42YM, which confer the ability to bind the 4C variant, 
and, additionally, the mutations 71EK95DN101LS102GD, which confer the "reverse" Tc- 
regulated phenotype, fused to VP 16. Thus, this reverse activator binds to tetO-4C in the 

20 presence of Tc to thereby stimulate expression of the gene of interest. Regulation of the gene 
is also mediated, in part, by a "silencer" composed of a class D TetR having the DBD 
mutations 37EAYM42, which confer the ability to bind the 4C variant but also the ability to 
bind a broader spectrum of operator sequences than the triple DBD mutation of the reverse 
transactivator, fused to a silencer domain, e.g., from v-erbA or Drosophila Krueppel. Thus, 

25 this silencer binds to tetO-AC in the absence of Tc to thereby inhibit expression of gene X. 
Based on the results in Part C above, the class B TetR-containing reverse transactivator and 
the class D TetR-containing silencer will not heterodimerize. The combined use of the 
reverse transactivator and the silencer to regulate the gene of interest has the advantage that 
expression of the gene of interest is highly repressed by the silencer in the absence of Tc but 

30 is efficiently stimulated by the reverse transactivator in the presence of Tc. 

In the second combinatorial regulatory system, two genes in the same cell are 
independently regulated by a transactivator fusion protein and a reverse transactivator fusion 
protein. This scheme is illustrated in Figure 14, Model B. In this system, at least one tetO- 
4C variant sequence is operatively linked to a minimal promoter ("TATA") and a first gene of 

35 interest ("X"). Additionally, at least one te/0-6C variant sequence is operatively linked to a 
minimal promoter ("TATA") and a second gene of interest (" Y"). Regulation of gene X in a 
cell is accomplished using an "activator" composed of a class B TetR having the DBD 
mutations 37EA39PQ42YM, which confer the ability to bind the 4C variant, in an otherwise 
wild-type sequence, fused to VP 16. Thus, this activator binds to tetO-AC in the absence (but 




not the presence) of Tc to thereby stimulate expression of gene X. Regulation of gene Y in 
the cell is accomplished using a "reverse activator" composed of a class D TetR having the 
DBD mutations 37ESWR43HN44, which confer the ability to bind the 6C variant, and, 
additionally, the mutations 71EK95DN101LS102GD, which confer the "reverse" Tc- 
5 regulated phenotype, fused to VP 16. Thus, this reverse activator binds to tetO-6C in the 
presence of Tc to thereby stimulate expression of gene Y. Based on the results in Part C 
above, the class B TetR-containing transactivator and the class D TetR-containing reverse 
transactivator will not heterodimerize. This combinatorial regulatory system thus allows for 
stimulation of one gene of interest in a cell in the absence of Tc and stimulation of another 
10 gene of interest in the same cell in the presence of Tc. 



EQUIVALENTS 

Those skilled in the art will recognize, or be able to ascertain using no more than 
1 5 routine experimentation, many equivalents to the specific embodiments of the invention 

described herein. Such equivalents are intended to be encompassed by the following claims. 
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15 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1008 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 
25 (A) NAME/KEY: exon 

(B) LOCATION: 1..1008 

( ix) FEATURE : 

(A) NAME /KEY : mRNA 
30 (B) LOCATION: 1..1008 

(ix) FEATURE: 

(A) NAME / KEY : misc. binding 

(B) LOCATION: 1. .207 

35 

(ix) FEATURE: 

(A) NAME/KEY: misc. binding 

(B) LOCATION: 208.. 335 

40 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .1005 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

45 ATG TCT AGA TTA GAT AAA AGT AAA GTG ATT AAC AGC GCA TTA GAG CTG 4 8 
Met Ser Arg Leu Asp Lys Ser Lys Val lie Asn Ser Ala Leu Glu Leu 
15 10 15 

CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA ACC CGT AAA CTC GCC CAG 96 
50 Leu Asn Glu Val Gly lie Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 

20 25 30 

AAG CTA GGT GTA GAG CAG CCT ACA CTG TAT TGG CAT GTA AAA AAT AAG 144 
Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
55 35 40 45 



CGG GCT TTG CTC GAC GCC TTA GCC ATT GAG ATG TTA GAT AGG CAC CAT 192 
Arg Ala Leu Leu Asp Ala Leu Ala lie Glu Met Leu Asp Arg His His 
50 55 60 

5 ACT CAC TTT TGC CCT TTA AAA GGG GAA AGC TGG'CAA GAT TTT TTA CGC 24 0 

Thr His Phe Cys Pro Leu Lys Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 70 75 80 

AAT AAG GCT AAA AGT TTT AGA TGT GCT TTA CTA AGT* CAT CGC AAT GGA 2 88 
10 Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asn Gly 

85 90 95 

GCA AAA GTA CAT TCA GAT ACA CGG CCT ACA GAA AAA CAG TAT GAA ACT 336 
Ala Lys Val His Ser Asp Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
15 100 105 110 

CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA CAA GGT TTT TCA CTA GAG 384 

Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 

115 120 125 

20 

AAT GCA TTA TAT GCA CTC AGC GCT GTG GGG CAT TTT ACT TTA GGT TGC 432 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 

130 135 140 

25 GTA TTG GAA GAT CAA GAG CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA 4 80 

Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 155 160 

CCT ACT ACT GAT AGT ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA 52 8 

30 Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 

165 170 175 

TTT GAT CAC CAA GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG 5 76 

Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
35 180 185 190 

ATC ATA TGC GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCC GCG 624 
lie lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser Ala 
195 200 205 

40 

TAC AGC CGC GCG CGT ACG AAA AAC AAT TAC GGG TCT ACC ATC GAG GGC 672 
Tyr Ser Arg Ala Arg Thr Lys Asn Asn Tyr Gly Ser Thr lie Glu Gly 
210 215 220 

45 CTG CTC GAT CTC CCG GAC GAC GAC GCC CCC GAA GAG GCG GGG CTG GCG 72 0 

Leu Leu Asp Leu Pro Asp Asp Asp Ala Pro Glu Glu Ala Gly Leu Ala 
225 230 235 240 

GCT CCG CGC CTG TCC TTT CTC CCC GCG GGA CAC ACG CGC AGA CTG TCG 76 8 

50 Ala Pro Arg Leu Ser Phe Leu Pro Ala Gly His Thr Arg Arg Leu Ser 

245 250 255 

ACG GCC CCC CCG ACC GAT GTC AGC CTG GGG GAC GAG CTC CAC TTA GAC 816 
Thr Ala Pro Pro Thr As^ Val Ser Leu Gly Asp Glu Leu His Leu Asp 
55 260 265 270 

GGC GAG GAC GTG GCG ATG GCG CAT GCC GAC GCG CTA GAC GAT TTC GAT 864 
Gly Glu Asp Val Ala Met Ala His Ala Asp Ala Leu Asp Asp Phe Asp 
275 280 285 
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CTG GAC ATG TTG GGG GAC GGG GAT TCC CCG GGT CCG GGA TTT ACC CCC 912 

Leu Asp Met Leu Gly Asp Gly Asp Ser Pro Gly Pro Gly Phe Thr Pro 
290 -295 300 

5 

CAC GAC TCC GCC CCC TAC GGC GCT CTG GAT ATG GCC GAC TTC GAG TTT 960 

His Asp Ser Ala Pro Tyr Gly Ala Leu Asp Met Ala Asp Phe Glu Phe 
305 310 315 320 

10 GAG CAG ATG TTT ACC GAT CCC CTT GGA ATT GAC GAG TAC GGT GGG TAG 1008 
Glu Gin Met Phe Thr Asp Pro Leu Gly lie Asp Glu Tyr Gly Gly 
325 330 335 

(2) INFORMATION FOR SEQ ID NO : 2 : 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Ser Arg Leu Asp Lys Ser Lys Val lie Asn Ser Ala Leu Glu Leu 
15 10 15 

Leu Asn Glu Val Gly lie Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
25 20 25 30 

Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

30 Arg Ala Leu Leu Asp Ala Leu Ala lie Glu Met Leu Asp Arg His His 
50 55 60 



35 



50 



Thr His Phe Cys Pro Leu Lys Gly Glu Ser Trp Gin Asp Phe Leu Arg 

65 70 75 80 

Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asn Gly 

85 90 95 



Ala Lys Val His Ser Asp Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
40 100 105 110 

Leu Glu Asn Gin Leu Ala Phe leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

45 Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 



Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 

145 150 155 160 

Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 

165 170 175 



Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
55 180 185 190 
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Ile lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser Ala 
195 200 205 

Tyr Ser Arg Ala Arg Thr Lys Asn Asn Tyr Gly* Ser Thr lie Glu Gly 
210 215 220 

Leu Leu Asp Leu Pro Asp Asp Asp Ala Pro Glu Glu Ala Gly Leu Ala 
225 230 235 240 

Ala Pro Arg Leu Ser Phe Leu Pro Ala Gly His Thr Arg Arg Leu Ser 
245 250 255 

Thr Ala Pro Pro Thr Asp Val Ser Leu Gly Asp Glu Leu His Leu Asp 
260 265 270 

Gly Glu Asp Val Ala Met Ala His Ala Asp Ala Leu Asp Asp Phe Asp 
275 280 285 

Leu Asp Met Leu Gly Asp Gly Asp Ser Pro Gly Pro Gly Phe Thr Pro 
290 295 300 

His Asp Ser Ala Pro Tyr Gly Ala Leu Asp Met Ala Asp Phe Glu Phe 
305 310 315 320 

Glu Gin Met Phe Thr Asp Pro Leu Gly lie Asp Glu Tyr Gly Gly 
325 330 335 



INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



GAC GCG CTA GAC GAT TTC GAT CTG GAC ATG TTG 3 3 

Asp Ala Leu Asp Asp Phe Asp Leu Asp Met Leu 
15 10 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(v) FRAGMENT TYPE: internal 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Asp Ala Leu Asp Asp Phe Asp Leu Asp Met Leu 
15 10 

10 (2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

25 Met Pro Lys Arg Pro Arg Pro 

1 5 

(2) INFORMATION FOR SEQ ID NO: 6: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 



55 



(ii) MOLECULE TYPE: DNA 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

GAATTCGGGG CCGCGGAGGC TGGATCGGTC CCGGTGTCTT CTATGGAGGT CAAAACAGCG 6 0 

TGGATGGCGT CTCCAGGCGA TCTGACGGTT CACTAAACGA GCTCTGCTTA TATAGGTCGA 12 0 

45 

GTTTACCACT CCCTATCAGT GATAGAGAAA AGTGAAAGTC GAGTTTACCA CTCCCTATCA 180 

GTGATAGAGA AAAGTGAAAG TCGAGTTTAC CACTCCCTAT CAGTGATAGA GAAAAGTGAA 24 0 

50 AGTCGAGTTT ACCACTCCCT AC CAGTGATA GAGAAAAGTG AAAGTCGAGT TTACCACTCC 3 00 

CTATCAGTGA TAGAGAAAAG TGAAAGTCGA GTTTACCACT CCCTATCAGT GATAGAGAAA 36 0 

AGTGAAAGTC GAGTTTACCA CTCCCTATCA GTGATAGAGA AAAGTGAAAG TCGAGCTCGG 42 0 

TACCCGGGTC GAGTAGGCGT GTACGGTGGG AGGCCTATAT AAGCAGAGCT CGTTTAGTGA ~ 48 0 

ACCGTCAGAT CGCCTGGAGA CGCCATCCAC GCTGTTTTGA CCTCCATAGA AGACACCGGG 54 0 
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ACCGATCCAG CCTCCGCGGC CCCGAATTC 



569 



10 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



15 



20 



25 



30 



35 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
AGATC TGC AG GGTCGCTCGG TGTTCGAGGC CACACGCGTC 
CCGGATCTCG AGTTTAC C AC TCCCTATCAG TGATAGAGAA 
ACTCCCTATC AGTGATAGAG AAAAGTGAAA GTCGAGTTTA 
AGAAAAGTGA AAGTCGAGTT TACCACTCCC TATCAGTGAT 
TTTACCACTC CCTATCAGTG ATAGAGAAAA GTGAAAGTCG 
TGATAGAGAA AAGTGAAAGT CGAGTTTACC ACTCCCTATC 
GTCGAGCTCG GTACCCGGGT CGAGTAGGCG TGTACGGTGG 
TCGTTTAGTG AACCGTCAGA TCGCCTGGAG ACGCCATCCA 
AAGACACCGG GACCGATCCA GCCTCCGCGG CCCCGAATTC 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Human cytomegalovirus 

(B) STRAIN: K12, Towne 



ACCTTAATAT 
AAGTGAAAGT 
CCACTCCCTA 
AGAGAAAAGT 
AGTTTACCAC 
AGTGATAGAG 
GAGGCCTATA 
CGCTGTTTTG 



GCGAAGTGGA 
CGAGTTTACC 
TCAGTGATAG 
GAAAGTCGAG 
TCCCTATCAG 
AAAAGTGAAA 
TAAGCAGAGC 
ACCTCCATAG 



60 
120 
180 
240 
300 
360 
420 
480 
520 



(ix) FEATURE: 

(A) NAME /KEY: mRNA 

(B) LOCATION: 382. .450 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GAATTCCTCG AGTTTACCAC TCCCTATCAG TGATAGAGAA AAGTGAAAGT CGAGTTTACC 60 
55 ACTCCCTATC AGTGATAGAG AAAAGTGAAA GTCGAGTTTA CCACTCCCTA TCAGTGATAG 12 0 



AGAAAAGTGA AAGTCGAGTT TACCACTCCC TATCAGTGAT AGAGAAAAGT GAAAGTCGAG 180 

TTTACCACTC CCTATCAGTG ATAGAGAAAA GTGAAAGTCG AGTTTACCAC TCCCTATCAG 240 

TGATAGAGAA AAGTGAAAGT CGAGTTTACC ACTCCCTATC " AGTGATAGAG AAAAGTGAAA 300 

GTCGAGCTCG GTACCCGGGT CGAGTAGGCG TGTACGGTGG GAGGCCTATA TAAGCAGAGC 360 

5 TCGTTTAGTG AACCGTCAGA TCGCCTGGAG ACGCCATCCA CGCTGTTTTG ACCTCCATAG 420 

AAGACACCGG GACCGATCCA GCCTCCGCGG 450 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human cytomegalovirus 

(B) STRAIN: Towne 

20 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 382.. 450 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

GAATTCCTCG ACCCGGGTAC CGAGCTCGAC TTTCACTTTT CTCTATCACT GATAGGGAGT 6 0 

GGTAAACTCG ACTTTCACTT TTCTCTATCA CTGATAGGGA GTGGTAAACT CGACTTTCAC 12 0 

TTTTCTCTAT CACTGATAGG GAGTGGTAAA CTCGACTTTC ACTTTTCTCT ATCACTGATA 18 0 

GGGAGTGGTA AACTCGACTT TCACTTTTCT CTATCACTGA TAGGGAGTGG TAAACTCGAC 24 0 

30 TTTCACTTTT CTCTATCACT GATAGGGAGT GGTAAACTCG ACTTTCACTT TTCTCTATCA 300 

CTGATAGGGA GTGGTAAACT CGAGTAGGCG TGTACGGTGG GAGGCCTATA TAAGCAGAGC 360 

TCGTTTAGTG AACCGTCAGA TCGCCTGGAG ACGCCATCCA CGCTGTTTTG ACCTCCATAG 420 

AAGACACCGG GACCGATCCA GCCTCCGCGG 450 



35 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Herpes Simplex Virus 

(B) STRAIN: KOS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GAGCTCGACT TTCACTTTTC TCTATCACTG ATAGGGAGTG GTAAACTCGA CTTTCACTTT 
TCTCTATCAC TGATAGGGAG TGGTAAACTC GACTTTCACT TTTCTCTATC ACTGATAGGG 
AGTGGTAAAC TCGACTTTCA CTTTTCTCTA TCACTGATAG GGAGTGGTAA ACTCGACTTT 
CACTTTTCTC TATCACTGAT AGGGAGTGGT AAACTCGACT TTCACTTTTC TCTATCACTG 
ATAGGGAGTG GTAAACTCGA CTTTCACTTT TCTCTATCAC TGATAGGGAG TGGTAAACTC 
GAGATCCGGC GAATTCGAAC ACGCAGATGC AGTCGGGGCG GCGCGGTCCG AGGTCCACTT 
CGCATATTAA GGTGACGCGT GTGGCCTCGA ACACCGAG 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ACTTTATCAC TGATAAACAA ACTTATCAGT GATAAAGA 38 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
ACTCTATCAT TGATAGAGTT CCCTATCAGT GATAGAGA 



38 



# 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) Sequence characteristics: 

(A) LENGTH : 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AGCTTATCAT CGATAAGCTA GTTTATCACA GTTAAATT 



(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
ACTCTATCAT TGATAGGGAA CTCTATCAAT GATAGGGA 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AATCTATCAC TGATAGAGTA CCCTATCATC GATAGAGA 



(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

5 ATG TCT AGA TT*V GAT AAA AGT AAA GTG ATT AAC AGC GCA TTA GAG CTG 48 
Met Ser Arg Leu Asp Lys Ser Lys Val lie Asn Ser Ala Leu Glu Leu 
15 10 15 

CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA ACC CGT AAA CTC GCC CAG 96 
10 Leu Asn Glu Val Gly lie Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 

AAG CTA GGT GTA GAG CAG CCT ACA TTG TAT TGG CAT GTA AAA AAT AAG 144 
Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
15 35 40 45 

CGG GCT TTG CTC GAC GCC TTA GCC ATT GAG ATG TTA GAT AGG CAC CAT 192 

Arg Ala Leu Leu Asp Ala Leu Ala lie Glu Met Leu Asp Arg His His 

50 55 60 

20 

ACT CAC TTT TGC CCT TTA GAA GGG GAA AGC TGG CAA GAT TTT TTA CGT 24 0 

Thr His Phe Cys Pro Leu Glu Gly Glu Ser Trp Gin Asp Phe Leu Arg 

65 70 75 80 

25 AAT AAG GCT AAA AGT TTT AGA TGT GCT TTA CTA AGT CAT CGC GAT GGA 288 
Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Gly 
85 90 95 

GCA AAA GTA CAT TTA GGT ACA CGG CCT ACA GAA AAA CAG TAT GAA ACT 336 
30 Ala Lys Val His Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA CAA GGT TTT TCA CTA GAG 384 
Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
35 115 120 125 

AAT GCA TTA TAT GCA CTC AGC GCT GTG GGG CAT TTT ACT TTA GGT TGC 4 32 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

40 

GTA TTG GAA GAT CAA GAG CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA 4 80 

Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 155 160 

45 CCT ACT ACT GAT AGT ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA 528 
Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 170 175 

TTT GAT CAC CAA GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG 576 
50 Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

ATC ATA TGC GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCC 621 
lie lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
55 195 200 205 



(2) INFORMATION FOR SEQ ID NO: 17: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Ser Arg Leu Asp Lys Ser Lys Val lie Asn Ser Ala Leu Glu Leu 
15 10 15 

Leu Asn Glu Val Gly lie Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 

Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

Arg Ala Leu Leu Asp Ala Leu Ala lie Glu Met Leu Asp Arg His His 
50 55 60 

Thr His Phe Cys Pro Leu Glu Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 70 75 80 

Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Gly 
85 90 95 

Ala Lys Val His Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 155 160 

Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 170 175 

Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

lie lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ATG TCT AGA TTA GAT AAA AGT AAA GTG ATT AAC AGC GCA TTA GAG CTG 4 8 
5 Met Ser Arg Leu Asp Lys Ser Lys Val lie Asn Ser Ala Leu Glu Leu 
15 10 15 

CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA ACC CGT' AAA CTC GCC CAG 96 
Leu Asn Glu Val Gly lie Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
10 20 25 30 

AAG CTA GGT GTA GAG CAG CCT ACA CTG TAT TGG CAT GTA AAA AAT AAG 144 
Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

15 

CGG GCT TTG CTC GAC GCC TTA GCC ATT GAG ATG TTA GAT AGG CAC CAT 192 
Arg Ala Leu Leu Asp Ala Leu Ala lie Glu Met Leu Asp Arg His His 
50 55 60 

20 ACT CAC TTT TGC CCT TTA AAA GGG GAA AGC TGG CAA GAT TTT TTA CGC 24 0 

Thr His Phe Cys Pro Leu Lys Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 70 75 80 

AAT AAG GCT AAA AGT TTT AGA TGT GCT TTA CTA AGT CAT CGC AAT GGA 288 
25 Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asn Gly 

85 90 95 

GCA AAA GTA CAT TCA GAT ACA CGG CCT ACA GAA AAA CAG TAT GAA ACT 336 
Ala Lys Val His Ser Asp Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
30 100 105 110 

CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA CAA GGT TTT TCA CTA GAG 3 84 

Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 

115 120 125 

35 

AAT GCA TTA TAT GCA CTC AGC GCT GTG GGG CAT TTT ACT TTA GGT TGC 432 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 

130 135 140 

40 GTA TTG GAA GAT CAA GAG CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA 480 
Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 155 160 

CCT ACT ACT GAT AGT ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA 52 8 

45 Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 

165 170 175 

TTT GAT CAC CAA GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG 576 
Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
50 180 185 190 

ATC ATA TGC GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCC 621 

lie lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 

55 



(2) INFORMATION FOR SEQ ID NO: 19: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Ser Arg Leu Asp Lys Ser Lys Val lie Asn Ser Ala Leu Glu Leu 
15 10 15 

Leu Asn Glu Val Gly lie Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 

Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

Arg Ala Leu Leu Asp Ala Leu Ala lie Glu Met Leu Asp Arg His His 
50 55 60 

Thr His Phe Cys Pro Leu Lys Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 70 75 80 

Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asn Gly 
85 90 95 

Ala Lys Val His Ser Asp Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 155 160 

Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 170 175 

Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

lie lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GAC ATG GAA AAA GCG ACA CCG GAG ACG ATG GTC CAT TGG ATT TGT CTG 48 
Asp Met Glu Lys Ala Thr Pro Glu Thr Met Val His Trp lie Cys Leu 
15 10 15 

AAG ATG GAG CCA GCT CTG TGG ATG GCC ATT ACA GCA ACA TCG CAC GGC 96 
Lys Met Glu Pro Ala Leu Trp Met Ala lie Thr Ala Thr Ser His Gly 
20 25 30 

GCA AGG CAC AGG ACA TTC GTC GGG TTT TCC GGC TGC CTC CAC CGC AAA 144 
Ala Arg His Arg Thr Phe Val Gly Phe Ser Gly Cys Leu His Arg Lys 
35 40 45 

15 TCC CTC ACG TAC CCA GTG ATA TGC CTG AGC AAA CCG AGC CAG AGG ATT 192 
Ser Leu Thr Tyr Pro Val lie Cys Leu Ser Lys Pro Ser Gin Arg lie 
50 55 60 
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20 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



Asp Met Glu Lys Ala Thr Pro Glu Thr Met Val His Trp lie Cys Leu 
30 1 5 10 15 

Lys Met Glu Pro Ala Leu Trp Met Ala lie Thr Ala Thr Ser His Gly 
20 25 30 

35 Ala Arg His Arg Thr Phe Val Gly Phe Ser Gly Cys Leu His Arg Lys 
35 40 45 

Ser Leu Thr Tyr Pro Val lie Cys Leu Ser Lys Pro Ser Gin Arg lie 

50 55 60 

40 

(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



50 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
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CTG GAC GAC TCG AAG CGC GTA GCC AAG CGG AAG CTG ATC GAG GAG AAC 48 
Leu Asp Asp Ser Lys Arg Val Ala Lys Arg Lys Leu lie Glu Glu Asn 
15 10 15 

5 CGG GAG CGG CGA CGC AAG GAG GAG ATG ATC AAA -TCC CTG CAG CAC CGG 96 
Arg Glu Arg Arg Arg Lys Glu Glu Met lie Lys Ser Leu Gin His Arg 
20 25 30 

CCC AGC CCC AGC GCA GAG GAG TGG GAG CTG ATC CAC GTG GTG ACC GAG 144 
10 Pro Ser Pro Ser Ala Glu Glu Trp Glu Leu lie His Val Val Thr Glu 
35 40 45 

GCG CAC CGC AGC ACC AAC GCG CAG GGC AGC CAC TGG AAG CAG AGG AGG 192 
Ala His Arg Ser Thr Asn Ala Gin Gly Ser His Trp Lys Gin Arg Arg 
15 50 55 60 

AAA TTC CTG CTC GAA GAT ATC GGT CAG TCG CCC ATG GCC TCC ATG CTT 240 

Lys Phe Leu Leu Glu Asp lie Gly Gin Ser Pro Met Ala Ser Met Leu 
65 70 75 80 

20 

GAC GGG GAC AAA GTG GAC CTG GAG GCG TTC AGC GAG TTT ACA AAA ATC 288 

Asp Gly Asp Lys Val Asp Leu Glu Ala Phe Ser Glu Phe Thr Lys lie 
85 90 95 

25 ATC ACG CCG GCC ATC ACC CGC GTG GTC GAC TTT GCC AAA AAC CTG CCC 336 
lie Thr Pro Ala lie Thr Arg Val Val Asp Phe Ala Lys Asn Leu Pro 
100 105 110 

ATG TTC TCG GAG CTG CCG TGC GAG GAT CAG ATC ATC CTG CTG AAG GGC 384 
30 Met Phe Ser Glu Leu Pro Cys Glu Asp Gin lie lie Leu Leu Lys Gly 
115 120 125 

TGC TGC ATG GAG ATC ATG TCG CTG CGC GCC GCC GTG CGC TAC GAC CCC 432 
Cys Cys Met Glu He Met Ser Leu Arg Ala Ala Val Arg Tyr Asp Pro 
35 130 135 140 

GAG AGC GAA ACG CTG ACG CTG AGC GGG GAA ATG GCC GTC AAA CGC GAG 480 

Glu Ser Glu Thr Leu Thr Leu Ser Gly Glu Met Ala Val Lys Arg Glu 
145 150 155 160 

40 

CAG TTG AAG AAC GGA GGG CTG GGG GTC GTG TCT GAT GCC ATC TTC GAC 52 8 

Gin Leu Lys Asn Gly Gly Leu Gly Val Val Ser Asp Ala He Phe Asp 
165 170 175 

45 CTC GGC AAG TCG CTG TCT GCC TTC AAC CTG GAC GAC ACC GAG GTG GCC 576 
Leu Gly Lys Ser Leu Ser Ala Phe Asn Leu Asp Asp Thr Glu Val Ala 
180 185 190 

CTG CTG CAG GCC GTG CTG CTC ATG TCC TCA GAC CGG ACG GGG CTG ATC 624 
50 Leu Leu Gin Ala Val Leu Leu Met Ser Ser Asp Arg Thr Gly Leu He 
195 200 205 

TGC GTG GAT AAG ATA GAG AAG TGC CAG GAG TCG TAC CTG CTG GCG TTC 672 
Cys Val Asp Lys He Glu Lys Cys Gin Glu Ser Tyr Leu Leu Ala Phe 
55 210 215 220 

GAG CAC TAC ATC AAC TAC CGC AAA CAC AAC ATT CCC CAC TTC TGG TCC 720 
Glu His Tyr He Asn Tyr Arg Lys His Asn He Pro His Phe Trp Ser 
225 230 235 240 
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AAG CTG CTG ATG AAG GTG GCG GAC CTG CGC ATG ATC GGC GCC TAC CAC 768 
Lys Leu Leu Met Lys Val Ala Asp Leu Arg Met lie Gly Ala Tyr His 
245 250 255 

GCC AGC CGC TTC CTG CAC ATG AAG GTG GAG TGC CCC ACC GAG CTC TCC 816 
Ala Ser Arg Phe Leu His Met Lys Val Glu Cys Pro Thr Glu Leu Ser 
260 265 270 



(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



Leu Asp Asp Ser Lys Arg Val Ala Lys Arg Lys Leu lie Glu Glu Asn 
20 1 5 10 15 

Arg Glu Arg Arg Arg Lys Glu Glu Met lie Lys Ser Leu Gin His Arg 
20 25 30 

25 Pro Ser Pro Ser Ala Glu Glu Trp Glu Leu lie His Val Val Thr Glu 
35 40 45 



30 



45 



Ala His Arg Ser Thr Asn Ala Gin Gly Ser His Trp Lys Gin Arg Arg 
50 55 60 

Lys Phe Leu Leu Glu Asp lie Gly Gin Ser Pro Met Ala Ser Met Leu 
65 70 75 80 



Asp Gly Asp Lys Val Asp Leu Glu Ala Phe Ser Glu Phe Thr Lys lie 
35 85 90 95 

lie Thr Pro Ala lie Thr Arg Val Val Asp Phe Ala Lys Asn Leu Pro 
100 105 110 

40 Met Phe Ser Glu Leu Pro Cys Glu Asp Gin lie lie Leu Leu Lys Gly 
115 120 125 



Cys Cys Met Glu lie Met Ser Leu Arg Ala Ala Val Arg Tyr Asp Pro 

130 135 140 

Glu Ser Glu Thr Leu Thr Leu Ser Gly Glu Met Ala Val Lys Arg Glu 
145 150 155 160 



Gin Leu Lys Asn Gly Gly Leu Gly Val Val Ser Asp Ala lie Phe Asp 
50 165 170 175 

Leu Gly Lys Ser Leu Ser Ala Phe Asn Leu Asp Asp Thr Glu Val Ala 
180 185 190 

55 Leu Leu Gin Ala Val Leu Leu Met Ser Ser Asp Arg Thr Gly Leu lie 



195 

Cys Val Asp Lys 
210 

Glu His Tyr lie 
225 

Lys Leu Leu Met 



Ala Ser Arg Phe 
260 




200 

lie Glu Lys Cys 
215 

Asn Tyr Arg Lys 
230 

Lys Val Ala Asp 
245 

Leu His Met Lys 
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Gin Glu Ser Tyr 
220 

His Asn lie Pro 
235 

Leu Arg Met lie 
250 

Val Glu Cys Pro 
265 




205 

Leu Leu Ala Phe 

His Phe Tr^ Ser 
240 

Gly Ala Tyr His 

255 

Thr Glu Leu Ser 
270 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
TCCCCGGGTA ACTAAGTAAG GATCC 2 5 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

AGTGGGTCCC CGGGTGACAT GGAA 24 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: polypeptide 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
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Ser Gly Ser Pro Gly Asp Met Glu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 27 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
AGTGGGTCCC CGGGTCTGGA CGAC 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: polypeptide 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
Ser Gly Ser Pro Gly Leu Asp Asp 



